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(57) Abstract 

In order to identify genes whose proteins are involved in the regulation of SAR downstream of cell death, a screen was carried 
out to isolate mutants that constitutively express SAR. The PR- J promoter, the most reliable marker for the onset of SAR, was cloned 
in front of the firefly (Photinus pyralis) luciferase reporter gene and transformed into Arabidopsis. A transgenic line with a luciferase 
expression pattern that paralleled expression of the endogenous PR- J gene was identified and subjected to EMS mutagenesis to isolate 
mutants that constitutively expressed the P#-//luciferase gene. Biochemical, cytological, pathological, and genetic methods were employed 
to further characterize the mutants and to prove the isolation of SAR mutants downstream of cell death. This characterization enables 
one to distinguish between classes of cim mutants, and to describe physiological changes that occur during the maintenance phase of 
SAR, Furthermore, utilizing microarray chips, the entire plant genome was simultaneously surveyed for genes that change in expression 
in response to biotic and abiotic factors. By comparing gene expression changes across various treatments, groups of co-regulated genes 
(regulons) were identified and the genomic sequences of genes within a regulon were examined to identify common sequence motifs that 
are likely to act as regulatory elements. This approach used experimental design based upon the biology of the study system in combination 
with bioinformatics to analyze the results. 
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IDENTIFICATION OF DNA CONTROL ELEMENTS RESPONSIVE TO SPECIFIC 

STIMULI 

FIELD OF THE INVENTION 

The invention generally relates to the use of gene expression profiling to identify groups 
of genes that show a similar pattern of expression in response to biotic and abiotic stimuli. 
The invention more particularly relates to the use of the expression profile data thus generated 
to uncover gene groups that are co-regulated and to identify common DNA sequences that 
function to regulate gene expression in response to specific stimuli. 

BACKGROUND OF THE INVENTION 

The SAR Signal Transduction Cascade: 

Plants are constantly challenged by a wide variety of pathogenic organisms including 
viruses, bacteria, fungi, and nematodes. Crop plants are particularly vulnerable because they 
are usually grown as genetically-uniform monocultures; when disease strikes, losses can be 
severe. However, most plants have their own innate mechanisms of defense against 
pathogenic organisms. Natural variation for resistance to plant pathogens has been identified 
by plant breeders and pathologists and bred into many crop plants. These natural disease 
resistance genes often provide high levels of resistance to or immunity against pathogens. 

Systemic acquired resistance (SAR) is one component of the complex system plants use 
to defend themselves from pathogens (Hunt andRyals, 1996; Ryals et al 9 1996). See also, 
U.S. Patent No. 5,614,395. SAR is a particularly important aspect of plant-pathogen 
responses because it is a pathogen-inducible, systemic resistance against a broad spectrum of 
infectious agents, including viruses, bacteria, and fungi. When the SAR signal transduction 
pathway is blocked, plants become more susceptible to pathogens that normally cause disease, 
and they also become susceptible to some infectious agents that would not normally cause 
disease (Gaffhey et al, 1993; Delaney et al 9 1994; Delaney et aL, 1995; Delaney, 1997; Bi et 
al.> 1995; Mauch-Mani and Slusarenko, 1996). These observations indicate that the SAR 
signal transduction pathway is critical for maintaining plant health. 
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Induction OfSAR: 

SAR is triggered by certain necrosis, as found as part of disease symptoms or the 
hypersensitive response (HR). HR is a genetically programmed cell death process that is 
induced in many incompatible plant-pathogen interactions. It is also simulated in certain 
lesion mimic mutants in absence of pathogens (Dangl et aL, 1996). These mutants, called hd 
(lesions simulating disease resistance), acd (accelerated cell death) or cpr (constitutive PR 
genes) are in many cases also constitutively activated for SAR (Bowling et aL, 1994; Dietrich 
et al, 1994; Greenberg and Ausubel, 1993). Lesion mimic phenotypes can be caused by 
mutations in R genes, as seen in the Rpl mutant in maize (Hu et aL, 1996) but they can also be 
caused by metabolic perturbations (Dangl et aL, 1 996), and loss-of-fimction mutations in 
putative transcription factors (e.g. Isdl; Dietrich et aL, 1997). Mutants deficient for R gene- 
specific signal transduction provide another link between the induction of HR and SAR. The 
mutant ndrl (non-race specific disease resistance), which was isolated in a screen for 
susceptibility to avirulent Pseudomonas syringae strains, is deficient for the induction of a 
local defense reaction induced by J? genes of the LZ-NBS-LRR class (Aarts et aL, 1998; 
Century et aL 9 1995). Subsequently, SAR cannot be induced in these interactions in the ndrl 
mutant. Similarly, in the edsl (enhanced disease susceptibility) mutant, the signaling cascade 
induced by R genes containing an N-terminal TIR domain is interrupted (Aarts et aL, 1 998; 
Parker et aL, 1996). As a consequence, in interactions involving TIR-NBS-LRR R genes, no 
SAR is induced in edsL In these two mutants, the block in the induction of the SAR response 
may actually reside in the suppression of the formation of an HR rather than in a block of the 
SAR signaling cascade itself (Table 1 summarizes information on Arabidopsis mutants in 
disease resistance). 
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Table 1: Disease Resistance Mutants mArabidopsis thaliana (adapted from Maleck and 
Lawton, 1998) 



Mutant 


Screen and possible function 


Reference 


Avr-R gene interaction 






edsl (enhanced disease 
susceptibility) 

ndrl (non-race-specific disease 
resistance) 

Isdl, 6 (lesions simulating 
disease resistance) 

acdl (accelerated cell death) 
cell death 


Susceptibility to avirulent 
Peronospora parasitica isolates, 
member of the converging TTR-NBS- 
LRR R gene signaling pathway 
Susceptibility to avirulent 
Pseudomonas syringae strains, 
convergence of LZ-NBS-LRR R gene 
signaling 

Identification of spontaneous 
lesion formation. Wild-type alleles 
are involved in limiting initiation or 
spreading of celi death 

Same as Isds 


(Parker et a/., 1996) 

(Century et aL 9 1995) 

(Dietrich et aL, 1994) 

(Greenberg and 
Ausubel, 1993) 


cims/cpr (constitutive 
immunity/ 
constitutive PR gene 
expression) 

dndl (defense, no death) 
pad (phytoalexin deficient) 

SA accumulation 


Marker gene overexpression (PR-1 or 
PR-2); Role in SA biosynthesis or 
SAR upregulation 

Absence of HR when inoculated with 
avirulent Pseudomonas syringae, 
constitutive immunity 
No phytoalexin accumulation after 
infection by the moderate virulent 
pathogen Pseudomonas syringae pv. 
maculicola ES4326, Genes may be 
involved in phytoalexin biosynthesis 
or in general pathogen recognition 
and signaling 


(Bowling et al> 1994) 

(Yu etaU 1998) 

(Glazebrook and 
Ausubel, 1994) 


niml/nprl/sail (no 
immunity/no PR genes/SA 
insensitive) 

PR gene expression 


Susceptibility to virulent Peronospora 
parasitica isolates after chemical 
immunization, hypersensitive to 
Pseudomonas syringae, counter 
selection using a SA-inducible 
promoter; Niml seems to be a central 
component of SAR 


(Caoe/a/., 1994; 
Delaneye/a/., 1995; 
Shah etaL, 1997) 


non-SAR mutants 
edrl (enhanced disease 
resistance) 


Resistance to virulent Pseudomonas 
syringae pathovars; also resistant to 
Erisyphe cichoracearum 


(Fryeand Innes, 1998) 
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SAR can be distinguished from other inducible disease resistance responses by a 
number of associated biochemical and physiological changes, which eventually confer an 
increased resistance to a secondary pathogen attack. 

SAR Marker Proteins: 

The identification of SAR specific marker proteins was crucial to distinguish SAR from 
other inducible plant resistance responses and to allow its dissection. SAR markers are 
proteins whose expression is tightly correlated with the onset of SAR in uninfected tissue 
(Metraux et aL, 1989; Uknes et al., 1992; Ward et aL 9 1991). All of the SAR markers fall in 
the class of PR proteins, which were originally identified as low-molecular weight, acidic 
proteins accumulating after TMV infection of tobacco leaves (Van Loon, 1985) or other 
pathological and stress-related situations (Kombrink and Somssich, 1997). In tobacco, the set 
of SAR markers is encoded by at least nine gene families (Alexander et aL, 1992; Ward et al. y 
1991). In Arabidopsis, three SAR markers have been identified, called PR-1, PR-2 and PR-5 
(Uknes et al. f 1992), with PR-1 being the most specific marker protein. The Arabidopsis PR- 
1 promoter has been examined in detail for responsiveness to salicylic acid (S A) and 2,6- 
dichloro-isonicotinic acid (INA). Only two or three active promoter domains have been 
identified, none of them with homology to ethylene response binding protein sites, which may 
reflect on a molecular level the observed overall specificity of PR-1 induction (Lebel et aL, 
1998). 

SAR proteins accumulate to high levels during the maintenance phase of systemic 
acquired resistance, suggesting that these proteins contribute to resistance. In support of this 
idea, in vitro antimicrobial activity has been described for tobacco PR-la and other PR 
proteins. In vivo studies involving overexpression of PR- la in tobacco have demonstrated a 
significant increase in resistance to infection by the two oomycete pathogens, P. tabacina and 
P. parasitica var. nicotianae (Alexander et aL, 1993), This result, and similar findings with 
other SAR proteins, suggests that these proteins are causally associated with disease 
resistance. While the function of the PR-2 and PR-5 proteins can be derived from their 
sequences as p-l,3-glucanases and thaumatin/osmotin-like proteins, respectively, the function 
of the PR-1 protein remains elusive. 
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The Role Of Salicylic Acid In SAR: 

Early studies revealed that exogenous application of salicylic acid (SA) or 
acetylsalicylic acid could induce disease resistance and the expression of PR genes (Van Loon 
and Antoniw, 1982; White, 1979). In successive experiments, it was shown that SA 
accumulation is not only concomitant to but also required for SAR induction. SA is 
synthesized in plant cells from phenylalanine, which is converted to t-cinnamic acid (t-CA) by 
PAL, a key enzyme in the phenylpropanoid pathway. t-CA is converted to SA via benzoic 
acid, presumably in the chloroplast (Yalpani et aL, 1993). Free SA is rapidly conjugated to 
the 0-0-D glycoside (Enyedi et aL, 1992) by an SA-inducible UDP-glucose:SA 3-0- 
glycosyltransferase (Enyedi and Raskin, 1993), Only free SA is active, but its action can be 
mimicked by the functional analogs BTH (Benzothiadiazol, CGA 245704) and INA (2,6- 
dichloro-isonicotinic acid, CGA 41396) (Friedrich et aL, 1996; Gorlach et al 9 1996; Lawton 
et aL, 1996; Vernooij et aL, 1995). Using the Pseudomonas putida NahG gene, which 
encodes for a salicylate hydroxylase, plants unable to accumulate SA were generated (Gaffney 
et aL 9 1993). These plants were also unable to raise an SAR response, and were 
hypersusceptible to many virulent and avirulent pathogens (Delaney et aL, 1994). In grafting 
experiments with Afc/fG-expressing tobacco plants (NahG), it has been shown that SAR 
induction does not occur in a NahG scion grafted on a wild-type rootstock infected with TMV. 
But the signal can be produced and transmitted from a 7Va#G-expressing rootstock after 
infection with TMV to a wild-type scion as determined by SAR activation (Vernooij et aL, 
1994). These results confirmed previous kinetic studies in cucumber that indicated SA is not 
the signal responsible for systemic induction of SAR (Rasmussen et al. 9 1991). Furthermore, 
Willits et aL (Willits and Ryals, 1998) showed that, in tobacco, resistance and SAR gene 
expression are linearly correlated to the SA content. This finding negates the argument that 
residual SA in NahG plants might be sufficient to activate SAR and leads to a model in which 
SA is required for defense gene expression in systemic tissue downstream of the systemic 
signal. 

In crosses of the NahG plant to lesion mimic mutants, two different classes of Isd 
mutants were identified. In the mutants Isdl and lsd4 9 lesion formation in these two mutants in 
a NahG background is unchanged, but the activation of SAR requires previous accumulation 
of SA (Hunt et aL y 1997). In the second class of lesion mimics, comprised of mutants Isdl, 



WO 00/65039 



PCT/US00/11460 



lsd6 and lsd7, cell death (and SAR gene expression) is suppressed in a NahG background but 
can be reinduced by exogenous application of DMA or BTH, thus positioning cell death and 
resistance both upstream and downstream of SA (Weymann et al. y 1995; R. Dietrich and J. 
Dangl, pers. comm.). HR and SAR induction appear to be tightly intertwined. 
Other pathogen-derived signals, such as cell-wall fragments, elicitors and wounding induce 
other systemic defense responses in the plant. 

Other Inducible Defense Responses And Their Relationship To SAR: 
The wound-inducible, jasmonic acid (J A) and ethylene mediated defense response has 
primarily been studied in the context of induced resistance to insect predation in tomato and 
tobacco, and more recently, also in Arabidopsis (McConn et aL y 1997). It is triggered by 
wounding and insect feeding and results in the induction of proteinase inhibitor (PI) genes 
(Creelman and Mullet, 1997; Ryan, 1990). The proteinase inhibitors interfere with digestion 
in the insect gut and discourage further feeding. Evidence is accumulating that the wound 
response signaling cascade may also play a role in defense against certain fungal pathogens 
(Staswick et aL, 1998; Vijayan et aL, 1998). It has been known for some time that the 
response can also be triggered by fungal elicitors (Boiler, 1991). In addition, the signaling 
molecules jasmonate and ethylene regulate not only wound-response genes, but are also 
involved in regulating expression of some (non-SAR) PR genes (Reymond and Farmer, 1998). 
They include genes for PR-3 (chitinase), PR-4 (thaumatin-like protein; Lawton et aL, 1994), 
defensins (PDFL2 = PR-13;Penninckx£tf a/., 1996; Penninckx et al 9 1998) and thionins (= 
PR-12; Vignutelli et al. 9 1998) in Arabidopsis. Defensins and thionins are small, cystein-rich 
peptides (~5 kDa) with potent in vitro activity inhibiting fungal growth (Bohlmann, 1994; 
Broekaert et aL, 1995). Structurally related peptides can be found not only in other plant 
species, but also in insects where they are also likely to participate in antimicrobial host 
defenses (Hancock et aL, 1995). Overexpression of defensins or thionins in plants leads to 
enhanced resistance to certain pathogens that are not affected by SAR (Epple et aL, 1997; 
Terras etal 9 1995). 

A genetic block in the wound-response signaling cascade renders plants more 
susceptible to necrotrophic fungal pathogens. For example, the jasmonate insensitive mutant 
jarl and the jasmonate-deficient triple fad3-2fad7~2fad8 mutant are both more susceptible 
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than wild-type plants to Botrytis cinerea or Pythium irregulare but not to "classical" SAR 
pathogens such as Peronospora (Staswick et aL, 1998; Thomma et aL, 1998; Vijayan et aL, 
1998). Similarly, mutants in the ethylene perception (ein and etr in Arabidopsis, ectopic etr in 
tobacco) show increased susceptibility (or increased tolerance) to avirulent and virulent 
pathogens (Bent et aL, 1992; Knoester et aL, 1998; Lund et aL, 1998). 

Furthermore, expression of an antisense construct of a lipoxygenase gene in tobacco results in 
reduced jasmonate synthesis and increased susceptibility to Rhizoctonia solani and 
Phytophtora parasitica (Ranee et aL, 1998). Thus, some components of the wound response 
maybe involved in resistance to certain pathogens as well as in resistance to insects. The 
pathogen spectrum might however vary from plant to plant and may overlap with the pathogen 
spectrum of SAR. 

These two induced "immune responses" seem to make the major contribution to the 
overall systemic response. However, other induced resistance pathways have been described. 
Induced Systemic Resistance (ISR) can be triggered in certain hosts, including Arabidopsis, 
by the biocontrol bacteria Pseudomonas fluorescens WCS 417, by Serratia macescens or by 
cell wall preparations of these microorganisms (i.e. lipopolysaccharides) (Pieterse et aL, 1996; 
Van Wees et aL, 1997). ISR confers quantitative resistance to fungal (Fusarium oxysporium) 
and bacterial pathogens (Pseudomonas syringae) and appears to be independent of SA, but 
dependent on ethylene and jasmonic acid (Pieterse et aL, 1998). Interestingly, none of the 
typical marker genes for JA/ethylene induced resistance is expressed during ISR. 

Similarly, in the edrl mutant (enhanced disease resistance), none of the known marker 
genes is induced while the mutant exhibits a good resistance to Erysiphe pathovars, but not to 
Peronospora (Frye and bines, 1998). 

Finally, in tobacco, the bacteria Erwinia carotovora induces a systemic resistance that is 
antagonistic to the induction of SAR. This resistance can be followed by the transcription 
level of basic PR genes (Vidal et aL, 1997), The pathogen specificities and interactions 
between the different pathways are just emerging (for review, see Maleck and Dietrich, 1999). 

Genetic Dissection Of SAR: 

To further dissect the signaling downstream of cell death, a screen for mutants unable to 
activate SAR after INA treatment was performed. Six alleles of a mutant hypersensitive to 
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Peronospora were isolated (Delaney et aL, 1995), In several other screens, including a marker 
gene based screen using aPi?-2/GUS reporter gene construct, the same gene was isolated, thus 
underlining the central importance of the NIM1/NPR1 (no immunity, no PR gene expression) 
gene for S AR activation downstream of SA (Cao et aL y 1 994). The niml/nprl gene was 
cloned by map-based cloning. The sequence has homologies to some ankyrin-containing 
transcription factor regulators, such as IkBoc (Cao et aL> 1997; Ryals et aL, 1997). 
Overexpression of the NIM1 gene results in plants that are poised to respond faster and 
stronger than wild-type plants after (subclinical) concentrations of chemical inducers (Cao et 
al, 1998; U.S. Patent No. 6,031,153). Interestingly, NIM1 is also required for the SA- 
independent ISR (Pieterse et aL> 1998). Furthermore, niml/nprl does not entirely block the 
resistance observed in some of the cpr mutants (Bowling et aL, 1997; Clarke et aL, 1998). 
For instance, in the mutant cpr5> resistance to Peronospora parasitica is «z>w/ -independent, 
whereas resistance to Pseudomonas syringae is abolished in the niml/nprl background 
(Bowling et al., 1997). Clearly, SAR is part of a complex signaling network and the 
expression of SAR in the plant is strictly regulated. 

Arabidopsis thaliana Is A Model System To Study SAR: 

Because the study of plant-pathogen interactions involves two living organisms, the 
choice of a model organism is crucial to reduce the complexity of the investigation. Tobacco 
was among the first plants in which a systemic activation of defense is observed. In 1961, 
Ross used the tobacco mosaic virus as both the inducing and the challenging pathogen to 
define SAR and Local Acquired Resistance (LAR) (Ross, 1961a; Ross, 1961b). Tobacco, 
because of its amenability to biochemical analysis and its facile transformability, remains an 
important system for the study of SAR. A wealth of pathogens, as well as elicitors that induce 
SAR in tobacco have been identified (Keller et aL, 1996). 

Tobacco is however not easily amenable to genetic studies for gaining molecular 
insights into the components of the SAR signaling cascade. For this approach, research has 
focused on the "useful weed" Arabidopsis thaliana (L.) Heynh. (Meyerowitz, 1989). 
Arabidopsis is attractive as a research tool because of its diminutive stature, short generation 
time (6 to 8 weeks), high seed yield and its small, well-characterized genome, which makes it 
of great use in the dissection of other signal transduction pathways, such as the ethylene or the 
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ABA signal transduction. In addition, members of several important plant pathogens are 
virulent on Arabidopsis (Meyerowitz and Somerville, 1994). Phylogenetically Arabidopsis 
belongs to the Brassicaceae family* which encompasses many crop plants, such as cabbage 
and mustard (Price et aL 9 1994). 

Molecular Tools Li Arabidopsis 

Arabidopsis thaliana has a small genome (110 Mb) with a high gene density (about 1 
gene per 4 kb, or an estimated 21,000 - 25,000 genes in total). Less than 10% of the genome, 
including centromeric and telomeric repeats, contains dispersed repetitive elements (Goodman 
et al. 9 1995; Pruitt and Meyerowitz, 1986). These characteristics make Arabidopsis an ideal 
plant for genetic and physical mapping projects. A large number of genetic markers (Table 2) 
and genetically diverse land races exist. A significant fraction of the genome has been 
assembled into physical contigs in high capacity cloning vectors, such as YACs, BACs and PI 
clones (Schmidt, 1998). A multinational sequencing effort is expected to finish the 
deciphering of all nucleotides of the plant's genome by the end of the year 2001 (Ecker, 
1998). Already an extensive amount of sequence information is available in the form of 
genomic sequence, cDNA sequence or as expressed sequence tags (est). A consortium of 
several French universities, the University of Minnesota and Michigan State University 
(MSU) has accumulated a public collection currently containing over 37,000 clones (Dec. 31 
1998), which are derived from estimated 14,000 distinct genes (Delseny et al. 9 1997; Newman 
et al y 1994). These resources, along with the ease of mutagenesis, transformation, and tagged 
mutant collections, have made Arabidopsis the model plant for functional genomics (Stiekema 
andPereira, 1998). 
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Table 2: Genetic Marker Types Currently Used In Arabidopsis Genetic Mapping Projects 
(partly derived from Rafalski et al. 9 1996, p.72). 



RFLP RAPP SSLP CAPS AFLP Heteroduplex 



Principle 


Genomic 


PCR 


PCR of 


Digest of 


PCR of a 


Altered gel 




rvsj a 


witn 


simple 


PCR 


subset of 


migration of 




digested, 


random 


sequence 


products 


restriction 


he tero duplex 




blotted and 


primers 


repeat 




fragments 


PCR fragment 




hybridized 




regions 




from 












extended 














adanter 














primers 




Nature of 


Single base 


Single 


Repeat 


Single base 


Single 


Single base 


polymorphism 


pair, 


base pair, 


length 


pair, 


base pair, 


nair insertion 




insertion, 


insertion, 


changes 


insertion, 


insertion, 


deletion 




deletion 


deletion 




deletion 


deletion 




Abundance in 


High 


Very 


Medium 


High 


High 


Very high 


the genome 




high 








TVHT A 


No 


No 


Yes 


Yes 


No 


Yes 


sequence 














information 














Dominance 


codominant 


dominant 


codominant 


codominant 


mixed 


mixed 


DNA required 


2 - 10 mg 


10-25ng 


50-100 ng 


50-100 ng 


1 -2mg 


10-100ng 


Available 


>550 


N/A 1 


50 


120 


395 


19 


markers 
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The term "functional genomics" describes the combined efforts to elucidate the functions of 
the increasing numbers of unknown genes identified by mass-sequencing. Currently, the 
functions of roughly 50% of all putative genes are unknown (Bevan et aL, 1998). Homology 
searches, expression profiling, knock-out mutant analysis, overexpression studies, and 
protein-protein interaction analysis might all yield clues to the biochemical, cellular, adaptive 
or developmental role of a given protein (Bouchez and Hofte, 1 998). To match the high- 
throughput sequencing effort, highly paralleled technologies have been conceived, mostly to 
gather gene expression patterns (Table 3). Considering the importance of transcriptional 
changes during plant defense (for example induction of genes such as 4-coumarate;CoA ligase 
(4CL\ phenylalanine ammonia lyase (PAL), and pathogenesis related (PR) genes; Hahlbrock 
and Scheel, 1989), gene expression profiles potentially play an important role in unraveling 
the complexity of regulatory pathways in plant-pathogen interactions. Addition references of 
note include Cho, et aL (1998a); Cho, et. al. (1998b); DeRisi, et aL (1997); Eisen, et aL 

(1998) ; Fambrough, et aL (1999) ; Gerhold, etaL (1999); Iyer, et aL (1999); Kehoe, et 
a/.(1999); Lemieux, etaL (1998); Roth, etaL (1998); Schena, et aL (1998); Tavozoie, et aL 

(1999) ; Tomayo, et aL (1999); and Winzeler, et aL (1999). 
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Table 3: Survey Of Technologies Of Mass-Gene Expression Profiling 



Technology 



Principle 



Comments 



References 



SAGE (serial 
analysis of gene 
expression) 



DNA 

oligonucleotide 
array (chip) 



DNA microarray 



cDNA AFLP 



Differential 
display, PCR 
select 



Short cDNA sequence 
tags (9-11 bp) are 
concatenated and 
sequenced 



For every gene, several 
short, complementary 
oligonucleotides 
(25mers) are 
photolithographically 
synthesized on a glass 
support. Hybridization 
with fluorescently 
labeled cDNA 
populations 

cDNA or est fragments 
are bound to a support 
(glass or membrane). 
Either competitive or 
simple hybridization of 
labeled cDNA 
populations is 
measured 

gel-based comparison 
of two populations of 
cDNAs, labeled by the 
AFLP technology. 

PCR-based 
normalization and 
enrichment of 
differentially abundant 
RNA molecules, 
yielding a cDNA 
librarv 



Rarer transcripts 
may be missed, the 
genome sequence 
(or exhaustive est 
sequences) should 
be available to 
identify the tags. 
Not convenient for 
mass comparisons 
To chose the best 
oligonucleotides for 
every gene, the 
entire genome 
sequence must be 
known. The nec plus 
ultra in 

mmiaturization 
(4,000,000 probes 
per 1.28 cm 2 ) 

Many different 
formats are 
available. Only 
known genes can be 
monitored, rare 
transcripts are likely 
to be missed. 

Labor-intensive 
identification of 
bands by excision 
and cloning, not 
automatable 
Leads primarily to 
the identification of 
differentially 
expressed genes, 
successive 
expression analysis 
still to be done. Not 
high-throughput 



(Velculescu etat., 1997) 



(Wodicka etaU 1997) 



(Ruan et al y 1998; Schena 
et al, 1995; Desprez et aL, 
1998) 



(Bachem et al 9 1996) 



(Diatchenko et al, 1996) 
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Inducible Gene Expression: 
A principal advantage to be realized through genetic engineering of plants is the 
controlled expression of chimeric genes so that they are expressed only at the appropriate 
time, to the appropriate extent, and in some situations in particular parts of the plant. For 
example, the energy expended by a plant to continuously produce high levels of a foreign 
protein could prove detrimental to the plant, whereas if the gene were expressed only when 
desired, the drain on energy and therefore yield could be reduced* Additionally, the phenotype 
expressed by the chimeric gene could result in adverse effects to the plant if expressed at 
inappropriate times during development. For tissue in culture or in a bioreactor, the untimely 
production of a desired secondary product could lead to a decrease in the growth rate of the 
culture, resulting in a decrease in the yield of the product. Thus, specific regulation of plant 
gene expression by exogenous application of chemicals to increase or decrease expression of 
transgenes of interest could be of particularly great commercial value to both seed and crop 
protection businesses as well as to end users (e.g. food processors) of agricultural 
commodities. Promoters activated by specific stimuli can be used for regulated expression of 
value added traits, input traits and output traits as well as for production of certain proteins 
(e.g. antibodies, etc). 

In view of such considerations, it is apparent that control of the time, extent, and/or site 
of expression of chimeric genes in plants or plant tissues would be highly desirable. An ideal 
situation would be the at-will regulation of expression of an engineered trait via a regulating 
intermediate that could be easily applied to field crops, ornamental shrubs, bioreactors, etc. 

Several plant genes are known to be induced by various internal and external factors 
including plant hormones, heat shock, chemicals, pathogens, lack of oxygen, and light. For 
example, as discussed above, exogenous application of SA induces SAR and expression of PR 
genes (Ward, et al 1991; Uknes, et ah, 1992) as well as of synthetic compounds such as 2,6- 
dichloroisonicotinic acid (INA) (Vernooij, etaL, 1995) andbenzo(l,2,3)thiadiazole-7- 
carbothioic acid S-methyl ester (BTH) (Friedrich, et al 9 1996; Lawton, et al 9 1996). 
Therefore, induction of PR protein genes by chemicals or pathogens provides a foundation to 
address the problem of controlling gene expression in plants and plant tissue. 

Studies of the effects of chemical regulators on the promoters of several PR genes have 
been described in the literature, shedding additional light of the SAR signal transduction 
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pathway. Deletion analysis of the tobacco PR-la promoter revealed that a 600-bp long 
promoter lost its functional inducibility by pathogen infection as well by exogenous chemical 
application while 661 -bp retained inducibility although to a lesser extent than a 903-bp long 
fragment (Uknes, et aL, 1993). Analysis of the tobacco PR-2d promoter revealed that some of 
its inducility is lost in a 607-bp long fragment but that 1047-bp are required for maximal 
induction, while a 321 -bp long promoter had lost almost any inducibility (Hennig, et aL 
1 993). Also, a Myb-like transcrition factor (mybl) was isolated and its expression shown to 
be inducible by SA and tobacco mosaic virus (Yang, et aL 1996). Furthermore, it was shown 
to bind in vitro to a fragment of the tobacco PR- la promoter (positions -679 to -487 from the 
transcription start site) containing a Myb-like recognition site (positions -520 to -514). 
Moreover, a sequence in the tobacco PR-2d promoter (-348 to -324) was shown to bind in 
vitro to another protein. Mutation of this sequence reduced inducibility by SA by 
approximately 3-fold but not completely compared to wild-type sequence when included in a 
fragment spanning positions -364 to -288 in the PR-2d promoter and fused to the -90 35S 
CaMV promoter (Shah, et aL 1996). 

United States Patent No, 5,614,395 describes the Arabidopsis PR-1 protein gene and its 
chemically inducible promoter. As described in this patent, the full-length Arabidopsis PR-1 
promoter fragment was fused to the firefly luciferase (LUC) gene and ultimately cloned into 
plasmid pAtPRl-S, which is in turn transformed into Arabidopsis plants for chemical 
induction analysis. The transgenic Arabidopsis lines carrying the PR-1 promoter/LUC gene 
fusion are then treated by spraying with isonicotinic acid (DSTA). When analyzed, the 
transgenic lines showed significantly higher induction of luciferase activity compared to 
water-treated controls. Thus, INA was shown to induce expression in transformed plants of a 
chimeric gene comprising the full-length Arabidopsis PR-1 promoter fragment. Subsequently, 
WO 98/03536 described deletion mutants of \he Arabidopsis PR-1 promoter that are shorter 
than the full-length Arabidopsis PR-1 promoter sequence, yet still yield similar induction of 
gene expression upon the application of a chemical regulator. 

Despite the above successes, previous methods for identifying regulatory elements in 
genes have largely relied upon extensive molecular manipulations commonly referred to as 
'promoter bashing' that involve deletion and/or replacement of DNA sequences in the 5' 
regulatory region of the gene, ligation of the modified promoter sequence to a reporter gene 
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(e.g. CAT, GUS, luciferase, GFP), transformation of plants and evaluation of effects of the 
modification on gene expression by monitoring the activity of the reporter gene in the 
transgenic plants. Such work can be time consuming, and typically only a few genes can be 
evaluated at a time. 

In view of the above, there is a long-felt but unfulfilled need for new methods of 
discovering chemically inducible regulatory elements, particularly chemically inducible 
regulatory elements in plants. 

SUMMARY OF THE INVENTION 

The present invention addresses the aforementioned needs by providing a new method 
whereby the entire genome (transcriptome) is simultaneously surveyed for genes that change 
in expression in response to biotic and abiotic factors. By comparing gene expression changes 
across various treatments, groups of co-regulated genes (regulons) are identified and the 
genomic sequences of genes within a regulon are examined to identify common sequence 
motifs that are likely to act as regulatory elements. These regulatory elements are then used to 
make promoters that drive controlled gene expression. This approach uses experimental 
design based upon the biology of the study system in combination with bioinformatics to 
analyze the results. 

In a preferred embodiment, gene expression profiling using DNA microarrays is used to 
study the transcriptome of a plant to identify groups of genes that show a similar pattern of 
expression in response to biotic and abiotic stimuli, especially biotic and abiotic inducers of 
S AR. Expression profile data can uncover gene groups that are co-regulated (regulons), and 
can be used to identify common DNA sequences that function to specifically regulate gene 
expression in response to exogenous factors but not endogenous signals. The resulting 
regulatory sequence elements can be cloned and used to precisely regulate genes of interest in 
transgenic plants. 

The present invention is useful for identifying genes that are responsive to BTH and/or 
pathogens, using PCR select and microarray gene chip technology. Experiments are 
conducted to compare expression profiles in response to biotic and abiotic inducers of SAR 
and to assess the requirement for salicylic acid and the NIM1 gene for mRNA accumulation. 
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By analysis of data generated with cDNA microarrays, sets of genes that are responsive 
specifically to exogenous application of BTH, a chemical that can activate the SAR response, 
can be identified. That there exists a set of BTH-inducible genes that are not responsive to SA 
or pathogens is especially surprising given the expectation from previous studies of the SAR 
signal transduction pathway that BTH would always act as a functional analog of SA and 
would also activate the SAR response in the same manner as pathogen infection. 

In one example, global gene expression changes in Arabidopsis are monitored under 
different SAR inducing or SAR-repressing conditions using a DNA microarray with 10,000 
ESTs. 660 ESTs differentially expressed under two or more of these conditions are chosen to 
derive groups of genes with common regulation patterns, or regulons. The regulon containing 
the PR1 gene, a reliable marker for SAR induction in Arabidopsis, contains other PR-genes 
and also defines additional novel genes likely to function during SAR and pathogen response. 
A common promoter element is identified in genes of the PR1 regulon. This element is bound 
by members of a plant-specific transcription factor family. Our results confirm expression 
profiling as a means to both define regulation networks and accelerate pathway-specific gene 
discovery in plants. 

Hence, in one embodiment, the present invention is directed to a method for isolating a 
regulatory DNA sequence from a differentially expressed gene, comprising: 

(a) obtaining an expression profile of a genome under two or more different 
conditions; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a gene that is differentially expressed among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the gene identified as being 
differentially expressed and regulatory elements associated with said gene; and 

(e) isolating a regulatory DNA sequence associated with the differentially expressed 
gene from the genomic DNA. 

In another embodiment, the present invention is directed to a method for isolating a 
common regulatory DNA sequence from a group of co-regulated genes, comprising: 

(a) obtaining an expression profile of a genome under two or more different 
conditions; 
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(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a group of genes that are co-regulated with respect to each other but 
that are differentially expressed with respect to the rest of the genome among the different 
conditions; 

(d) obtaining the sequence of genomic DNA comprising the genes identified as being 
differentially expressed and regulatory elements associated with said genes; 

(e) identifying common regulatory DNA sequences among the genomic DNA 
associated with the co-regulated genes; and 

(f) isolating a common regulatory DNA sequence associated with the differentially 
expressed genes from the genomic DNA. 

In a preferred embodiment, said regulatory DNA sequence is a promoter. In another 
preferred embodiment, said genome is a plant genome. In yet another preferred embodiment, 
said expression profile is obtained using a DNA microarray. In one preferred embodiment, 
said two or more different conditions comprise biotic stimuli. In another preferred 
embodiment, said two or more different conditions comprise abiotic stimuli. In yet another 
preferred embodiment, said two or more different conditions comprise biotic and abiotic 
stimuli. 

According to a preferred embodiment, said genome is a plant genome and wherein said 
two or more different conditions comprise at least one SAR inducing condition or at least one 
SAR repressing condition. Preferably, said least one SAR inducing condition comprises 
pathogen infection, SA application, BTH application, NIM1 gene expression, or a cim 
mutation, and preferably said least one SAR repressing condition comprises NahG expression 
or a niml mutation. 

In an additional embodiment, the present invention is directed to a method for isolating 
a regulatory DNA sequence from a differentially expressed plant gene, comprising: 

(a) using DNA microarray technology to obtain an expression profile of a plant 
genome under two or more different conditions, wherein said conditions comprise at least one 
SAR-inducing or SAR-repressing condition; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 
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(c) identifying a gene that is differentially expressed among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the gene identified as being 
differentially expressed and regulatory elements associated with said gene; and 

(e) isolating a regulatory DNA sequence associated with the differentially expressed 
gene from the genomic DNA. 

In still another embodiment, the present invention is directed to a method for isolating a 
common regulatory DNA sequence from a group of co-regulated plant genes, comprising: 

(a) using DNA microarray technology to obtain an expression profile of a genome 
under two or more different conditions, wherein said conditions comprise at least one SAR- 
inducing or SAR-repressing condition; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a group of genes that are co-regulated with respect to each other but 
that are differentially expressed with respect to the rest of the genome among the different 
conditions; 

(d) obtaining the sequence of genomic DNA comprising the genes identified as being 
differentially expressed and regulatory elements associated with said genes; 

(e) identifying common regulatory DNA sequences among the genomic DNA 
associated with the co-regulated genes; and 

(f) isolating a common regulatory DNA sequence associated with the differentially 
expressed genes from the genomic DNA. 

Preferably, said regulatory DNA sequence is a promoter. Also, preferably, said least 
one SAR inducing condition comprises pathogen infection, SA application, BTH application, 
NIM1 gene expression, or a aim mutation, and said least one SAR repressing condition 
comprises NahG expression or a niml mutation. In an especially preferred embodiment, said 
two or more different conditions comprise BTH application, and at least one condition 
selected from the group consisting of pathogen infection, SA application, and a niml 
mutation. In a most preferred embodiment, said two or more different conditions comprise 
BTH application, pathogen infection, and SA application, and wherein one or more genes are 
identified that are inducible by BTH application but not by pathogen infection or SA 
application. In another especially preferred embodiment, said two or more different 
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conditions comprise BTH application and a niml mutation, and wherein one or more genes 
are identified that are inducible by BTH application in niml mutant plants. It is desirable in 
the above embodiments that said one or more genes are inducible at least 5x by BTH 
application. 



DETAILED DESCRIPTION OF THE INVENTION 



Understanding the regulation of plant defense responses is a central goal of current plant 
molecular biology. Research has mainly focused on the initial recognition of the pathogen by 
the plant and on the identification of effector molecules that may be responsible for plant 
defense. This led to the concepts of race-specific resistance and disease resistance marker 
genes. Many marker genes are classified as pathogenesis related (PR) genes that are induced 
following attack by a wide range of pathogens. A subclass of PR genes, termed SAR genes, is 
induced concomitant with the onset of SAR. The expression of the SAR gene PR-1 in 
Arabidopsis is directly correlated with the S A content in the plant making PR- J expression an 
appropriate marker for genetic SAR induction. The recent functional and structural analysis 
of the Arabidopsis PR-1 promoter revealed several INA-responsive elements, as well as 
putative cis elements not involved in the SAR response (Lebel et aL> 1998). Thus PR-1 
transcription may not be strictly limited to defense responses of plants to pathogen attack, and 
in fact PR-1 expression has been observed in other circumstances as well (Uknes et al. y 1993). 
The PR-1 peptide is neither required nor sufficient to confer complete disease resistance. 
Rather, it is thought that the combination of many factors (chitinases, glucanases, 
antimicrobial peptides) together confer the broad-spectrum disease resistance observed during 
SAR. Based on structural similarity to small antimicrobial peptides, PR-1 is likely to be one 
of the factors contributing to this resistance. 

From the structural analysis of several other promoter sequences of Pi? genes, which do 
not necessarily share the same cis elements (reviewed by Rushton and Somssich, 1998; Yang 
et aL, 1997), it can be derived that the signal transduction pathway is diverging from a 
common element in this cascade, salicylic acid. Consequently, a screen based on PR-1 as the 
sole marker can neither identify all members of the signaling cascade nor be totally specific 
for mutations in the SAR signaling cascade. But it can identify some signaling elements that 
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link the pathogen recognition event with the multi-component defense response. The goal of 
this study was to identify an array of different mutants by monitoring for the expression of one 
marker gene. These mutants were subsequently analyzed to further our understanding about 
SAR regulation. 

1 . Principle of the screen for cim mutants 

To isolate new disease resistance mutants we screened for plants that constitutively 
expressed the PR-1 gene. To monitor PR-1 gene expression, we chose the luciferase reporter 
gene because luciferase activity can be monitored in vivo without affecting the integrity of the 
plant. This feature opens up the possibility to rapidly examine many plants as well as to 
reexamine the same tissue several times throughout the experiment (Millar et al y 1992). 
Also, luciferase activity can easily be reexamined in vitro providing a mean for fast 
confirmation and quantification of results obtained by in vivo monitoring. Two lines of 
evidence correlate light emission by Pi?-7/luciferase plants with SAR gene expression: 

• Treatment with chemical activators of SAR gene expression induces luciferase activity 
in vivo and in vitro with kinetics similar to those seen for PR-1 mRNA levels. 

• Infection of PiJ-i/luciferase plants with incompatible pathogens activates luciferase 
activity and PR-1 gene expression in both local and systemic tissue. 

Taken together, these results strongly suggest that in the selected PTJ-7/luciferase line, in 
vivo monitoring of luciferase activity represents a method for detection of the onset of SAR in 
these plants (within the biological specificity of PR-1 gene induction), and that this method 
can be used to identify mutants that display both constitutive SAR gene expression (PR-1, PR- 
J) and enhanced resistance to virulent pathogens. In addition, this reporter line could be used 
for detailed studies of tissue localization and timing of PR-1 gene expression, as this has been 
done before in parsley leaves using in situ hybridization (Reinold and Hahlbrock, 1996). 

The size of the EMS mutant screen was designed to near- saturate the genome with point 
mutations that might lead to constitutive PR-1 gene expression. We estimate that the 
mutation rate corresponded to a 400-fold increase of the natural mutation frequency (which 
has been estimated to 5 x 10" 4 mutations per gene per generation; Feldmann et al, 1994; Li 
and Redei, 1969). In previous screens for SAR mutants based directly on biological tests 
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(Delaney et ah, 1995; Glazebrook et al, 1996) or on the use of other reporter genes (Bowling 
et aL 9 1994; Cao et al y 1994), the Ml and for the most part the M2 populations were much 
smaller. The ease of mutagenesis by EMS is countered by the difficult cloning of the 
underlying (point) mutation, since EMS introduces primarily G-A transitions through O 6 
alkylation of guanine (Britt, 1999). Other commonly used mutagens in Arabidopsis include 
foreign insertion sequences, such as T-DNA or heterologous transposons that have been 
engineered to carry selectable marker genes (for review, see Stiekema and Pereira, 1998). 
While transposons (for example the EnlSpm system; Cardon et al. 9 1 993) move preferentially 
to closely linked locations and consist commonly of a two-component system (transposase 
and mobile element), T-DNA insertions occur randomly throughout the genome and do not 
need to be mobilized (Schulz et aL, 1994). Although only a few mutations per genome are 
introduced by T-DNA mutagenesis (thus increasing the required mutant population size), 
more mutations result in a detectable phenotype than witli EMS mutagenesis. Furthermore, T- 
DNA mutagenesis facilitates the cloning of the mutant gene (especially in cases where the 
phenotype is difficult to determine). We therefore also tried a cim screen with 10,000 T-DNA 
'tagged" mutant lines in the P^-Muciferase background. The frequency of cim mutations 
found in the two screens cannot be compared because the second screen was done in the Tl 
generation. 

Characterization of the different mutants validated the approach taken since all mutants 
with increased luciferase activity exhibited increased resistance to several virulent pathogens. 
The mutants fall into two classes, Isd mutants, the predominant class, and cim mutants. Since 
we were interested in mutants in SAR signal transduction, and not in mutants in which SAR is 
induced as a consequence of spontaneous cell death, we focused the study on cim mutants. 
cim mutants define a diverse group of loci with different disease resistance spectra. 

2. Genetics of cim mutants 

Interestingly, our EMS screen for cim mutants yielded mainly dominant or codominant 
mutations, which rendered genetic analysis, complementation tests, and pathway classification 
by epistasis studies more difficult than with recessive mutants because they require additional 
crosses, more generations and analysis of larger populations. We did, however, map four of 
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the cim mutants, demonstrating that several independent loci have been identified. Several 
hypotheses may be cited to explain the prevalence of (co-) dominant mutants. The SAR 
signaling cascade might be regulated by functionally overlapping signal transduction 
pathways. It has been estimated that 60% of all A. thaliana genes are present in multiple 
copies, based on sequence analysis. Parallel signaling cascades leading to SAR may be 
favored over single-gene regulation to counter the selection pressure exerted by the pathogen 
to overcome the plant defense. Thus, only dominant gain-of-function mutations would be 
phenotypically detectable. Alternatively, SAR signaling might be regulated by a cascade of 
repressors, in analogy to for example the pathway controlling apoptosis in Drosophila and C 
elegans (McCall and Steller, 1997; Vaux and Strasser, 1996). A complex network of 
repressors (bcl-2/ced-9, p35, and others) keep the cell alive and the loss of one of these factors 
is sufficient to induce apoptosis. Similarly, the SAR signaling cascade would be turned on by 
a loss-of-function mutation in a negative regulator, and this mutation could be dominant or 
appear to be dominant as a result of haplo-insufficiency (Melnick et al. y 1993). 

The cim mutations are often not expressed in 100% of the self-progeny. Incomplete 
penetrance has also been found in several hormone mutants (for instance impaired in the 
ethylene or giberellic acid pathways; Kieber, 1997; Ogas et al y 1997) and also in SAR 
mutants (e.g. fcd2, IsdT). To date, no genetic explanation for incomplete penetrance of 
mutations in isogenic backgrounds has been found. Exogenous and endogenous events, such 
as heat, light, and cell homeostasis can be decisive in triggering a labile switch, as known in 
less complex biological systems, such as the phage lambda lysis-lysogeny decision (McAdams 
and Shapiro, 1995; Ptashne, 1992). This bistable behavior is well known in biological 
network theory, and occurs also in higher eukaryotic cells: Feedback regulation can result in 
the presence of two discrete steady-state activities, such that a small stimulus is sufficient for a 
cell to trigger the transition to one state and to stabilize the cell in this state (Bhalla and 
Iyengar, 1999). Such a labile equilibrium would be in accordance with haplo-insufficient 
mutations. The loss of one gene copy of a regulatory factor might still allow proper regulation 
until stress situations titer this factor and the mutation becomes phenotypically evident. 

Interestingly, incomplete penetrance in mixed genetic background is a common feature 
of many mutants in disease resistance, found not only in Arabidopsis, but also in maize. This 
phenomenon may reflect race-specificity observed in many plant-pathogen interactions. Race- 
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specificity may not be limited to R genes, but may extend to parts of the signaling cascade 
leading to systemic resistance. It is known that several marker genes like thionins and 
CXC750 are induced in an ecotype-specific manner (Aufsatz and Grimm, 1994; Epple et aL, 
1998). The incomplete penetrance in mixed backgrounds rendered positional cloning 
difficult. Penetrance of the Cim phenotype in mixed ecotype backgrounds was for all tested 
mutants low (from 5% of the expected ratio to 20% in cimltt for which 1 5% of the F2 
outcross progeny displayed the phenotype instead of 75% expected). Positional cloning was 
further hampered because cimlYl lies in a genomic region with extremely low recombination 
and mutation frequency. It is interesting to note that the NIM1INPR1 gene is situated in a 
similar genomic region (Ryals et aL, 1997). 

There are a number of reasons why the screen did not recover multiple alleles of any 
genes, despite the thorough genome coverage of the EMS screen. The observed incomplete 
penetrance is one possible explanation. Another possible explanation is that different 
mutations in the same gene may cause either lesion mimic or Cim phenotypes. 

3, Nature of cim mutants 

Ten cim mutants were further characterized. While all these mutants were free of HR- 
like lesions, leaf and floral morphologies in some mutants deviated from wild-type. Two 
mutants had a wild-type morphology, suggesting that the morphological changes in the other 
cim mutants were due to mutations independent of the SAR mutation. However, three 
backcrosses did not separate the morphological changes from the SAR inducing mutation, 
Furthermore, the two wild-type looking mutants (cim32$ and cim65S) fell in the class of the 
weakest cim mutants, based on SA accumulation and PR-1 gene expression. It is therefore 
impossible at this point to distinguish between a causally related morphological change (i.e. 
SA as a morphogen) or an organ malformation induced as a consequence of the physiological 
cost of constitutive SAR induction (potential metabolic sinks include PR gene expression and 
SA accumulation). In experiments with repeated applications of chemical SAR inducers, no 
physiological cost of SAR induction was identified. Since these trials were performed on 
mature plants, they do not account for possible nutrition shortage during early development. 
However, metabolic engineering in plants has often shown that plants can produce huge 
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amounts of secondary metabolites or storage polymers without showing energy deficiencies. 
For instance, ectopic expression of a tryptophan decarboxylase in tobacco led to the 
accumulation of 1 milligram tryptamine per gram fresh weight of plants with unchanged 
morphology (Sonstad et ah, 1990). This value can be compared to up to 4 milligram salicylic 
acid per gram fresh weight in the cim mutants, which corresponds to a 15-fold increase to 
uninduced wild-type levels. It should be noted that ectopic expression of tryptophan 
decarboxylase in potato resulted in a severe alteration of the phenylpropanoid pathway. The 
plants were morphologically unchanged, though hypersusceptible to pathogens because of 
depletion of the chorismate pool (Yao et al. 9 1995). Plant metabolism appears to be very 
flexible in reacting to exogenous manipulations, such as the inhibition of amino acid 
biosynthesis (Guyer et al y 1995), or the perturbation of the carbohydrate homeostasis (Stitt et 
aL, 1990). Plants that overexpressed a yeast invertase increased the glycolysis and were able 
to define new "Sink** and "Source'* tissues (Sonnewald et aL y 1991). Similarly, it is likely that 
cim mutants can compensate for the increased energy flux through the SAR pathway by 
enhancing and deviating the metabolism, cim mutants often appear darker green, and might 
have a higher chlorophyll content than wild-type. It is also remarkable in this context that cim 
mutants show enhanced transcription of genes encoding functions in energy metabolism, 
photosynthesis and protein biosynthesis. Similar results in parsely cells also revealed 
extensive changes in metabolism during fungal infection (Batz et al. y 1998). 

Using differential resistance against pathogens, we can assemble a first-order 
classification of the cim mutants. Several mutants were highly resistant to all tested pathogens 
(e.g. cim611, cim%\0). Others were only resistant to fungal pathogens (e.g. cim205). Mutant 
cim695 was resistant to Pseudomonas and Peronospora spp., but not to Erysiphe. Mutant 
c/w328 was strongly resistant only to Erysiphe. The identification of an edr~]\ke mutant such 
as cim32S with weak accumulation of PR gene transcripts was possible because the luciferase 
reporter gene assay is very sensitive. 

It will be interesting to check for resistance to other pathogens, including viral, and root- 
specific {Pythium, nematodes) pathogens. These tests, as well as tests with non-SAR 
pathogens (e. g. Fusarium), are in progress, and will help to further define regulation and 
branch points in the induced plant disease resistance response. 
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As all dm mutants are resistant to several virulent pathogens, the resistance appears JR. 
gene-independent and does not require an HR. However, some cim mutants (for example 
cim695, ciml 1 6) develop a strong HR-like lesion formation in response to Peronospora 
parasitica inoculation. Most cim mutants are able to develop an HR in response to an 
avirulent bacterial pathogen but some appear to simply bypass HR (cww713, cimSlO). 
Arabidopsis mutants with a similar phenotype, called dnd (defense, no death) have recently 
been isolated (Yu et al 9 1998). They exhibit disease resistance to two virulent pathogens and 
do not develop an HR when inoculated with avirulent Pseudomonas strains, although they 
remain fully resistant. These cim and dnd mutants raise questions about the role of cell death 
in defense. Obviously, cell death is not required for an effective, incompatible gene-for-gene 
interaction, or for the induction of SAR in these mutants. HR may be required in wild-type to 
potentiate an SAR-inducing signal, possibly via the release of reactive oxygen species 
(Alvarez*?* al y 1998). 

It is tempting to speculate about the mechanistic nature of broad-spectrum disease 
resistance in the cim mutants. Because of the very different lifestyles of the pathogens used 
(Erysiphe, Peronospora spp., Pseudomonas), it is unlikely that simple host morphological 
changes, for example in the cuticle, are responsible for this resistance. Rather, the cim 
mutations cause a fundamental deregulation of the SAR pathway. In principle, hypermorphic 
gain-of- function mutations or haplo-insufficiencies may be able to activate the SAR pathway 
through cross-talk and side effects without being members of the signaling pathway per se. 
This might explain the different disease resistance spectra of some mutants. However, the 
same two classes of mutation could exist within the R gene-dependent signal cascades, 
although the resistance in cim mutants is not race-specific. A preinduction of the R gene- 
mediated signaling pathway could induce the cascade leading to an activation of SAR, and 
consequently convert a compatible interaction into an incompatible. In principle, even a 
mutation in an R gene could trigger SAR. It has previously been shown that such mutations 
can also cause a lesion mimic phenotype (shown for Rpl in maize (Hu et aL, 1996), and mlo 
in barley (Peierhansei et al> 1997)). 

How far beyond SA and PR transcript accumulation can the comparison between cim 
mutants ("hypochondriacs") and pathogen-infected plants be stretched? Upon infection by 
pathogens, plants undergo fundamental cellular and biochemical changes (Gus-Mayer et aL, 
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1998). Arabidopsis is not easily amenable to study morphological alterations on cellular 
level, such as cytoskeletal rearrangements, or increased ion fluxes through the plasma 
membrane. Measurable parameters include gene expression (PR-1)> accumulation of 
secondary metabolites (SA, camalexin) and apposition of cell wall material (e.g. callose). 
Parallels of the measurable biochemical changes in the cim mutants to pathogen-infected 
plants are restricted to systemic changes induced by pathogens and do not reflect the plant's 
local defense responses. 

The quantitative differences in disease resistance and biochemical markers among the 
cim mutants reveal a complex regulation pattern of the different signaling branches of disease 
resistance responses in Arabidopsis. To assess the phenotypic diversity of the cim mutants at 
molecular level, we utilized a highly paralleled gene expression profiling method, the DNA 
microarray. We monitored gene expression of roughly 5000 different genes in the cim 
mutants and following chemical induction of SAR. While expression of approximately one 
quarter of all Arabidopsis genes was monitored, pathogen-induced genes are likely to be 
underrepresented in the MSU est collection because the cDNA libraries were not derived from 
pathogen-induced tissue. The deciphering of the Arabidopsis genome will allow the 
monitoring of expression of all genes, as has been done for S, cerevisiae (DeRisi et al, 1997). 
In the meantime, ests are the best way to access genes in Arabidopsis. We showed that DNA 
microarray experiments can be compared to each other, using internal standardization. 
Internal control RNA also allowed us to estimate the sensitivity of detection. 0.06 ng spiked 
RNA was detected (at 1000 FSI just above background) which corresponds to a copy number 
of 1 per cell (based on 100,000 mKNA species per cell). The distribution of abundancies of 
mRNA species obtained by the microarray matched well the distribution curve obtained by 
other methods such as Rot kinetics (Kamalay and Goldberg, 1980). In addition, in 
comparisons to Northern blot analysis, we found very similar sensitivities for the DNA 
microarray in detecting differences in gene expression. The similar sensitivity of Northern 
blot analysis and DNA microarray is based on similar hybridization stringencies: The washing 
conditions of Northern blots (65°C, G + C buffer at 71 mM [Naj 9 DNA-RNA interactions) 
and DNA microarrays (30°C, 0.2 x SSC at 39 mM [Na + ], DNA-DNA interaction) have quite 
similar stringencies (but the estimates are hampered by the lack of exact formulas). 
Empirically, it can be assumed that cross-hybridization on the DNA microarray does not occur 
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to a greater extent than during Northern blot analysis, although cross-hybridization has to be 
taken into account for closely related gene family members with both methods (e.g. LTPs). 
The DNA microarray cannot distinguish between related transcripts of different size, whether 
from cross-hybridizing family members of from differential splicing (e.g. XET). Both 
sensitivity and reproducibility of microarrays are not yet matched by the other methods we 
used to assess the molecular differences in SAR induction. Two-dimensional protein gel 
electrophoresis suffers from insufficient resolution and difficulty in peptide identification. 
Metabolite profiling depends largely on the extraction protocol and on the detection method 
and hence detects only subsets of metabolites, for example compartmental or structural. 
Besides these technical limitations, it is currently unknown how many changes in secondary 
metabolites are induced during plant pathogen defense. In a similar study, in barley, only a 
few changes were detected in the biochemical cytosolic and cell wall composition during 
pathogen infection (von Roepenack et al 9 1 998). In conclusion, it is not straightforward at 
this time to match proteins to gene induction, and metabolites to proteins. Hence it is difficult 
to match traits to genes. 

To determine if correlations could be detected between pathogen-specificities and gene 
induction patterns, we chose from our collection three cim mutants that displayed different 
spectra of disease resistance. Despite clear morphological differences among the three cim 
mutants and the BTH control, many genes were altered in similar ways in the four 
experiments, compared to wild-type. Co-regulated sets of genes could be used as more 
reliable indicators of specific induced defense responses than single marker genes. Included 
in this group are the known SAR marker genes PR-1, PR-2, and PR-5> other pathogen- 
inducible genes such as extensins (Memelink et al> 1993) and CXC750 (Aufsatz and Grimm, 
1994), as well as genes encoding proteins with unknown function. 

A breakdown of the 478 elements with altered expression in the cim mutants into gene 
groups based on presumed function, shows that the largest class contains genes with no 
homologies or unclear functions (39%, table below). In the Arabidopsis genome (represented 
here by the ESSAi contig, 1.9 Mb; Bevan and ai., 1998) this number is higher, probably 
because many of the unknown genes are not highly transcribed and therefore not in the est 
collection. Results obtained from the C. elegans sequencing project support this hypothesis: 
conserved genes are more likely to be highly expressed (The C. elegans sequencing 
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consortium, 1998). Noteworthy is the high percentage of induced genes involved in disease 
and defense. 14% of the genes with altered expression in the cim mutants fall into this class 
compared to 7% in the ESSA contig. This is remarkable because the ESSA contig contains an 
MRC and therefore more genes from this class than the genome average (estimated at 2 - 3%; 
Botella et aL, 1997) and because in the est population, pathogen-induced ests are most likely 
underrepresented. For instance, no est corresponding to the PR-1 gene is present in the MSU 
collection. 



Classification of genes with altered expression in the cim mutants and comparison to 
prevalence of these gene classes in the genome (as represented by the ESSA1 contig). 


gene class 


gene expression ( 


%) genome (ESSA contig) (%) 


total number 


100 (n = 478) 


100 (n = 485) 


metabolism 


9.8 


10.4 


energy/ photosynthesis 


4.7 


2.9 


cell growth and division 


1.6 


2.9 


transcription 


5.0 


7.8 


protein synthesis 


4.5 


1.6 


protein transport and storage 


3.4 


2.6 


transporters 


1.6 


0.8 


intracellular traffic 


13 


1.0 


cell structure 


6.1 


3.6 


signal transduction 


6.6 


4.4 


disease/defense 


14.0 


7.3 


secondary metabolism 


1.3 


4.7 


transposons 


0,5 


2.6 


unclear 


22.5 


33.7 


no homology 


16.9 


15.8 



Genes for protein synthesis and energy production are induced disproportionally in the 
cim mutants. Genes for secondary metabolism, induced only under specific conditions, and 
transposons, usually not transcribed, are underrepresented in the collection of induced genes, 
compared to the percentage of genes in the genome. Unfortunately, no classifications for est 
libraries similar to the one shown in the above table have been published. Thus, no 
comparisons can be made between the numbers of genes of various classes in the est 
population in general versus those with altered expression in our mutants, nor can any 
statements be made regarding the representation of the genome in the est collection. 

The prevalence of common features in gene expression among the mutants lead us to the 
hypothesis that pathogen specificity in (genetically) induced SAR originates in subtle 
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differences of gene expression rather than the presence of several independent signaling 
pathways represented by distinct sets of marker genes. The high degree of similarity can be 
better expressed by statistical analysis than by gene expression profiles. We applied statistical 
methods normally used to calculate phylogenetic distance among macromolecules to calculate 
the similarities of gene expression patterns in the cim mutants and their crosses to NahG. The 
gene expression patterns in three mutants are indeed very similar, although distinct from each 
other. Mutant cim205 best mimics the chemical SAR induction by BTH. This is for example 
reflected in the suppression of the senescence associated proteins in these two samples that is 
not found in the other two mutants. 

Furthermore, we addressed the question of what genes were regulated in a similar way 
under various conditions. To form these groups of "regulons" (groups of co-regulated genes), 
we used the same calculation methods to find similarities in gene expression patterns under 16 
different conditions, including the 6 different experiments listed in table 9, and 10 others 
(including herbicide treatments and pathogen treatments). The PR-1 gene is almost unique in 
its expression regulation, clustering only with few other genes that match its expression 
pattern under these 16 different conditions. The three copies of PR-5 and of the PRXC gene, 
as well as 20 other ests> are included in this cluster. 

In addition to the identification of common patterns, we were also able to correlate 
specific gene groups to specific resistance phenotypes, but the causality of these correlations 
remains unclear for now. Causalities have in the past been proven to be difficult to establish 
from correlations. For instance, the role of the PR-1 protein in plant pathogen defense is still 
not clear and the function of the phytoalexin camalexin is only starting to be elucidated 
(Glazebrook et al. t 1997). Genetic studies addressing this question are complicated by 
functional overlap of the defense-related molecules and by the fact that quantitative 
differences in gene induction seem to be more important than qualitative differences in 
generating pathogen resistance specificities, as derived from our mutant characterization. 
Using all the cim mutants in our collection in DNA microarray experiments, it may be 
possible to match specific gene inductions to specific resistance spectra. 

By counting the total number of genes with altered expression, we concluded that 
cimlYh was likely to display a more complete resistance than the other two mutants, and this 
was confirmed by pathogen tests. It is, however, not easy to distinguish the influence of weak 
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and strong alleles from hierarchical gene induction effects. Finally, it has to be stressed that 
the regulation of gene transcription represents only one (important) level of control of cell 
activity. In higher eukaryotes, significant regulation also occurs at the translational and 
protein levels (hence the importance of 2D gel electrophoresis). In spite of its limitations, it 
can be anticipated that expression profiles will provide insight into gene expression 
regulation, of how, when, where and why the genes act in concert to regulate complex 
processes, such as pathogen defense, in a whole organism (Ruan et al, 1998). 

4. Epistasis studies and the relationship of dm mutants to salicylic acid 

Along with mutant classifications based on disease resistance spectra and gene 
expression profiles, epistasis studies to known disease resistance regulatory mutants, such as 
ndrl (Century et aU 1995), edsl (Parker et al 9 1996), mdpad4 (Glazebrook et al y 1997) will 
give further insight into the relative relationships among the loci identified by this collection 
of cim mutants. Because of genetic problems encountered during the work with the cim 
mutants, we limited the epistasis analysis to crosses of cim mutants to the dominant NahG 
plants, asking the question whether SA accumulation is a prerequisite of SAR induction in the 
cim mutants. From the study of several Isd mutants in the NahG background, it has been 
concluded that SA accumulation is regulated by a feedback regulation (Weymann, 1995). 
Similarly, biochemical studies have shown that PAL, a key regulator of the SA biosynthesis, 
is itself activated by SA (Mauch-Mani and Slusarenko, 1996). As a consequence, a mutation 
"downstream" of SA accumulation, such as a dominant-negative mutation in the NIM1INPR1 
gene might lead to both a Cim phenotype, and to S A accumulation by feedback activation. 
This could explain why no cim mutants without elevated levels of SA were identified. 
Mutants of this class might still display disease resistance when the SA accumulation is 
depleted by an SA hydroxylase (NahG). 

We in fact found two mutants cim695 and cimlU that exhibited SA-independent 
resistance to Peronospora, although PR-l/gene expression in all cim mutants was strictly 
dependent on SA accumulation. A similar phenotype as the one observed in these two cim 
mutants has been described for cpr5 and cpr6, which retained resistance to P. parasitica in a 
nimllnprl background while resistance to virulent Pseudomonas syringae strains was 
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suppressed (Bowling et al 9 1997; Clarke et al 9 1998). Disease resistance of these mutants in 
a NahG background, however, has not been reported. While cpr5 in a NahG background 
exhibits an extreme dwarf phenotype with black lesions (Bowling et al. 9 1997), mutants 
cim695 and cimlYh retained wild-type morphology in the NahG background. The common 
induction of SA-independent and SA-dependent disease resistance pathways in cim695 and 
cimlYi might illustrate that their signaling cascades share common members downstream or 
independent of cell death. 

Furthermore, cim695 and cimlU, which exhibit an S A-independent resistance, do 
accumulate S A to five fold higher levels than wild type. The resistance conferred by 
mutations cim695 and cimlXl may therefore lie in a feedback loop as suggested above, 
triggering multiple resistance mechanisms including SA-independent resistance pathways 
which lead subsequently to S A accumulation. SA-independent resistance has been described 
in the literature. For instance, a jasmonate-dependent defense response in Arabidopsis has 
been shown to confer resistance to a distinct set of pathogens (Thomma et al> 1998). This 
wound- and necrotrophic-inducible disease resistance is correlated with the expression of the 
PDFL2 gene (Penninckx et al y 1998). 

We used the collection of cim mutants to address whether PDFL 2 gene expression is 
correlated to SA-independent disease resistance, as suggested for other broad-spectrum 
disease resistance mutants (Bowling et aL, 1997). We found mutants that, when crossed into 
a NahG background, expressed the PDFL2 gene at high levels, but lost resistance to P. 
parasitica (e.g. cim205 9 cim1\6). Conversely, ciml\ 3 in a NahG background retained 
resistance to P. parasitica despite a severe reduction in PDFL2 expression. Thus, in our 
mutant collection, there is no consistent correlation between the transcription of the PDFL2 
gene and the activation of an SA-independent signaling pathway that confers resistance to P . 
parasitica. This may imply the presence of a third, SA-independent pathway leading to 
resistance to P. parasitica Noco2, which does not include induction of PDFL2. Induced 
systemic resistance (ISR) has been found to be both independent of SA and not linked to 
PDF1.2 expression (Pieterse et aL, 1998), Because no molecular marker or mutant defines 
ISR to date, it is not possible to compare the SA-independent resistance observed in the cim 
mutants to this phenomenon. 
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The difference in resistance of the "double mutants" is reflected in a similarity tree of 
gene induction in the mutants. The radiation between cimlYi and cimlOS in NahG 
backgrounds is bigger than it is in the mutants alone. But both resemble NahG plants more 
than they resemble the mutants. This addresses the question concerning the role of SA in 
plants. More than 300 genes show altered expression in NahG plants compared to wild-type. 
The removal of S A (and/or the accumulation of catechol) triggers a major switch in gene 
regulation, as does the accumulation of SA. Using the new gene expression profiling 
technology, we were able to visualize this global gene switch. There might be a well-defined 
master gene switch (such as ahomeobox regulation (Gehring, 1987), see Plesch et aL, 1997 
for an example in Arabidopsis) that is naturally induced by salicylic acid accumulation, and 
that can also be triggered by exogenous BTH, Based on this and the pleiotropic effects on 
plant development caused by increased SA accumulation (reminiscent of mutants impaired in 
hormone action; Kauschmann et ah, 1996), the role of S A as a plant hormone may need to be 
revisited. SA, as ethylene and jasmonic acid, might have more functions than the induction of 
disease resistance. The regulation of all three signal molecules is interdependent and the role 
of the two latter molecules in the cim mutants remains to be elucidated. 

5 . The Transcriptome of Arabidopsis During S AR 

To gain a more thorough understanding of the complex regulation of gene expression 
during SAR, and to identify new (marker) genes associated with SAR, we used expression 
profiling (Schena et al. 9 1995; Shalon et aU 1996). PolyA* RNAs obtained from plants 
grown under 18 different conditions that either induce or repress SAR (see table below) were 
used in mixed hybridizations against RNA of untreated wild-type plants on a DNA microarray 
(Ruan, et al, 1998) with 10,000 ESTs, a Unigene set representing roughly one third of all 
Arabidopsis genes. In addition, two treatments which alter plant metabolism, but are not 
related to SAR, were analyzed (see table below). Some samples were taken during the 
induction phase of SAR (4 hours after BTH induction), or from primary, pathogen-infected 
tissue, but most of the samples were derived from (steady state) SAR maintenance phase (e.g. 
48 hours after BTH induction, cim mutants). Together, the experiments gave us L8 x 10 5 
gene expression data points. Under these 18 conditions, however, only 660 different genes 
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displayed significant differential expression compared to wild-type in at least two SAR- 
relevant samples (significance level defined as 2.5 fold differences from wild-type). By 
comparing the fluorescence signal intensities to spiked controls, abundance ofmRNA species 
can be determined . Most of the genes with altered expression fall into the class of low- 
abundance transcripts (1 to 10 copies per cell), while a few were in the medium abundance 
class. Housekeeping genes, with more than 100 copies per cell, did not exhibit altered gene 
expression under SAR. 

Diversity of conditions used to describe the transcriptome of Arabidopsis thaliana 



during SAR. 



Sample (all compared to wild-type tissue) 


Comments 


Adenylosuccinate synthetase antisense 
Hydantocidin (0.9 mM) 


Reference inductions, irrelevant to resistance 


cim6 
cim? 
cimll 


Constitutive SAR mutants 


NIM1 overexpresser 


Primed SAR response 


NahG 

cim6NahG 
cimim&iG 

NahG + Pst DC3000 avrrpml T tissue, 44 hours 


No SA accumulation, no SAR response 


NahG + Pst DC3000 avrrpml V tissue, 44 hours 
BTH 4 hours 
BTH 48 hours 


Primary (partial) LAR induction 
SAR induction 
SAR maintenance 


P.p. EMWA1, 48 hours 

Pst DC3000 avrrpml V tissue, 44 hours 

Pst DC3000 avrrpml T tissue, 44 hours 

nim + Pst DC3000 avrrpml T tissue, 44 hours 

P.p. Noco5, 48 hours 


Compatible (disease causing) interaction 
Incompatible interaction 



First, we compared gene induction patterns of these 660 genes among each of the 
different conditions. Data representing a series of differential gene expression measurements 
was obtained. Analysis of both the regulation of individual genes under varying conditions or 
of the overall similarity of the various conditions based on expression profiles required the 
calculation of distance matrices. These were calculated in S-Plus using the dist function under 
the euclidean metric. Clustering of similarly regulated genes was performed and visualized in 
S-Plus using hierarchical clustering under the compact method with the, functions hclust and 
plclust. Trees used to depict similarity in gene expression patterns among the conditions may 



-33- 



WO 00/65039 

PCT/USOO/11460 

be produced using the fitch and drawtree programs distributed in the Phylip suite (Phylogeny 
Inference Package) version 3.57c. 

Genetic, chemical and biological induction of SAR all caused very similar patterns of 
gene induction. The three aim mutants, which constitutively express SAR, have a pattern of 
transcriptional induction similar to that caused by BTH during SAR. maintenance. 
Interestingly, the overexpression oiNIM, which primes the SAR response, also resembles 
SAR maintenance (U.S. Patent No. 6,031,153; Cao al, 1998; Maleck etal, 1998). 
Compatible and incompatible races of the pathogens Peronospora parasitica and 
Pseudomonas synngae induced many common genes, although the pattern of gene induction 
was not identical to either of the BTH-treated samples. In contrast to the requirement for SA 
accumulation to high levels for SAR induction, many gene inductions are triggered by low 
levels of SA. For example, in NahG-expressing plants that accumulate only low levels of SA 
the expression of more than 300 genes is significantly altered. This emphasizes a general 
regulatory role for SA in the plant cell. 

NahG suppresses SAR gene expression in crosses to two of the SAR-constitutive dm 
mutants, cim6 and ciml 1 , to a baseline resembling that of NahG-expressing plants. In 
addition, NahG expression results in a characteristic gene expression fingerprint in secondary 
ttssue from plants inoculated in primary tissue with avirulent bacteria. This corresponds to the 
inability of these plants to establish SAR. Interestingly, the corresponding primary tissues in 
NahG-expressing plants display changes in gene expression which compares very closely to 
wild-type primary, infected tissue and this sample does not cluster with other NahG samples 
This is consistent with findings from grafting experiments that SA is required for SAR 
development in systemic tissue, but that the systemic signal can be emitted from primary 
infected NahG-expressing tissue (Delaney et al., 1994; Gaffhey et al., 1993; Vemooii et al 
1994). 

We used these expression profiles to identify classes of genes that were consistently 
coregulated, and we sought to identify genes associated with SAR. We used statistical 
Phylogenetic cluster analysis to compare the gene expression profiles among 660 genes under 
all 1 8 conditions. Clusters of genes with similar induction patterns under all conditions were 
grouped. The two clusters most distinct from the PR1 gene induction pattern, with EST 
127P22T7 and EST 163B24T7 as 'type genes," contain genes that are strongly (more than 20 
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fold) suppressed by NahG expression. The cluster containing chalcone synthase (ChS) as the 
type gene includes genes induced during SAR-maintenance, but not during SAR-induction; 
these are weakly repressed by NahG expression. The cluster containing EST 209E19T7 
defines genes that are transcriptionally induced in NahG-expressing plants. The cluster 
containing EST 1 18P18T7 defines genes that are not significantly responsive to SAR- 
inducing conditions like chemical and genetic induction, but do respond to avirulent bacteria 
and are downregulated in NahG expressing plants. Phenylalanine ammonia lyase (PAL) and 
20 other ESTs that cluster together are repressed by NahG expression, but are induced during 
the maintenance phase of SAR, for example in cim mutants or 48 hours after BTH treatment. 
The cluster of "PR1 like" genes exhibits similar induction behavior to genes in, the PAL gene 
cluster but these genes are only weakly suppressed in NahG-expressing plants. 

To address the question of expression induction relatedness among genes within one 
cluster, we analyzed the PR1 cluster in detail. The PR1 regulon contained 25 other ESTs (17 
different genes). These are prime candidates for SAR marker genes and the encoded proteins 
are likely to play a physiological role in SAR. The estimated 1 .5 to 2-fold redundancy of our 
EST set is a good internal control for this analysis and we also included three replicates of the 
PR5 and the PerC cDNAs (as well as 28 other relevant cDNAs) on the DNA-microarray. All 
three copies of the two genes cluster with PR1, showing the robustness of the DNA 
microarray analysis. Similarly, two ESTs that are derived from Asparagine synthetase, two 
from the gene for blue copper binding protein and two from glutathkmine S-transferase (GST) 
cluster with PR1. The standard deviations for differences in expression of the 30 members of 
the PR1 cluster were smaller than one unit of differential gene expression (compared to 2.5 
units cut-off for our definition of significant alterations) under all 18 conditions. Thus, genes 
in one cluster show highly reproducible expression in our analysis. 

To help explain the molecular basis of this co-regulation, we analyzed thepromoters of 
those eight PR1 cluster ESTs encoded within sequenced genomic regions. Strikingly, these 
genes all shared a common TTGAC consensus cis-ekment known to bind proteins of the 
WRKY zinc finger transcription factor binding family, often in multiple copies. In control 
promoters of eight ESTs selected randomly among the 634 non-PRl-like ESTs, the element 
TTGAC was only found in the statistically expected frequency. The WRKY element was not 
present in three of the randomly selected promoters, in three promoters once, in one promoter 
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twice, and in one promoter three times. WRKY elements are sufficient for defense gene 
transcriptional induction in different plant systems (Lebel et al, 1998). A functional analysis 
of the BTH-induction of the PR1 promoter revealed two elements sharing all or four out of 
five consensus WRKY sites at -676 and - 638 (Lebel et al, 1998). The former acts as a 
repressor and the latter as a SA responsive element. It is therefore likely that WRKY 
transcription factors are responsible for the common induction pattern of the eight genes listed 
in Table 1 , and possibly also for the remaining 22 ESTs of the PR1 gene cluster for which no 
genomic sequence is presently available. 

We did not identify another common consensus sequence in the eight PR1 cluster 
promoters analyzed. The NFkB binding site found in the PR1 promoter which is required for 
full BTH-induction is not conserved among the eight promoters within the PR1 regulon. 

In addition to identification of common regulatory elements, cluster analysis of 
expression profiles provides a tool to derive physiological functions of genes. This is 
important for sequences with no close homologs in the databank (for example EST 
134C20T7 or EST 1 92 K7T7) and also for genes with structural similarity to genes with 
known function (such as asparagine synthetase). We anchored the results of our statistical 
analysis first to those genes that were known to be co-regulated with PR1. Interestingly, 
several genes in the PR1 cluster encode proteins involved in redox regulation. Although 
reactive oxygen intermediates do not likely act as secondary messengers in S AR downstream 
of SA accumulation, they are clearly implicated in defense response, either as local second 
messenger molecules, as direct cellular and microbial toxins or in cell-wall cross-linking and 
other oxidative processes (Alvarez et al, 1998). From the several known Arabidopsis 
peroxidases, peroxidase C seems to be specifically induced under plant defense conditions, as 
are some GSTs. Supporting this is the previous demonstration that both peroxidase C and 
GST are transcriptionally activated in a superoxide-dependent manner in the Arabidopsis 
mutant hdl , which lacks the ability to halt hypersensitive-response-like lesions (Jabs et al, 
1996; Dietrich et al, 1994). Likewise, the blue-copper protein might be involved in the 
regulation of redox stages during SAR or might be the chelator of free cellular copper used for 
Cu/Zn SOD. Cu/Zn SOD is suppressed during SAR (Fodor etal, 1997). 

It has been proposed that plant metabolism adapts to pathogen attack (Batz et al, 1998). 
We can now more precisely identify such changes and investigate their importance in disease 
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resistance. For example, it is instructive to speculate about the possible role of the 
coregulation of glutamine-dependent asparagine synthetase with the PR1 cluster during SAR. 
An increased flux of carbon through the phenylpropanoid pathway (lower overall N:C ratio) 
might liberate nitrogen, thus creating a demand for asparagine synthetase as a detoxifying 
enzyme. Similarly, pathogen infection increases the level of free ammonia in the plant, 
following an increased metabolism of proteins (Bate et al, 1 998). Asparagine synthetase can 
also utilize ammonia as a substrate. Hence, our observation that asparagine synthetase is co- 
regulated with PR1 focuses attention on physiological changes during SAR that might not be 
directly linked to defense. Alternatively, the induction of asparagine synthetase might be 
related to the induction of tyrosine transaminase. This is also the first time that the production 
of toxic mustard oils, released by myrosinase from glucosinolates, can be correlated to SAR in 
Arabidopsis. 

Our analysis describes the first map of the plant defense transcriptome during SAR in 
Arabidopsis. Using cluster analysis of DNA microarray data, significant changes in gene 
induction can be differentiated from random correlations. The large number of data points 
filters out coincidental co-induction and enhances the resolution and significance of serial 
analysis of expression profiles. It is clear that the description of complex signaling networks 
can only be obtained by investigating multiple inductive conditions, and not, as before, by 
pair-wise comparisons. Thus, similar analysis of additional mutants and inducing conditions 
will further enlarge the complexity and refine the. resolution of the entire Arabidopsis 
transcriptome. 



6. Conclusion 



We identified novel mutants that mimic the maintenance phase of SAR. Biochemical 
and molecular analysis revealed a high degree of similarities between these mutants and 
chemically SAR-activated plants. The mutants differed however in gene expression 
regulation, metabolite accumulation, genetic map-position and disease resistance spectrum. 
For instance, an SA- and />Z>/7.2-independent pathway leading to resistance to P. parasitica 
has been identified in one mutant, but not in others. Although questions relating SAR kinetics 
(induction phase) cannot be addressed in these mutants, the "frozen" steady-state of SAR (or 
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LAR) presents several experimental advantages that were exploited using three approaches to 
deepen the understanding of SAR: 
• Epistasis studies 

Epistasis studies with dm mutants might help to understand interactions between the 
different induced disease resistance pathways. The epistasis studies to NahG revealed the 
presence of SA-independent disease resistance in two dm mutants. It will be interesting to 
establish the spectrum of pathogens affected by this novel resistance. Furthermore, the 
varying expression of the PDF 1.2 gene in the dm mutants raises the question whether the 
wound-inducible or the ISR pathway are induced or repressed in different dm mutants. 
Mutants impaired in JA- or ethylene-signaling can be crossed to the dm mutants that exhibit 
an SA-independent disease resistance. These double mutants may provide insights into the 
role of these other signaling molecules in the Cim phenotypes and broad-spectrum disease 
resistance. Similarly, the direct biochemical measurement of JA and ethylene inplanta, and a 
more thorough study of JA and ethylene marker genes in different mutants may provide clues 
about pathway cross-talk. 

• Gene expression profiling of SAR 

We established a new technology for the study of plant signal transduction, the DNA 
microarray. Gene expression profiling enabled us to establish correlations between the 
expression of gene groups and SAR phenotypes (dm mutants, NahG). The correlations can in 
the future be refined with more mutants and with the use of more pathogens to better describe 
the differential disease resistance in these mutants. It became evident that gene profiling is 
potentially a powerful tool in understanding metabolic changes, but that it cannot yet (and to 
some extent will probably never) be able to describe how cells (or even organisms) work. 
Technology that needs to be improved includes analysis tools to optimize the data capture and 
mining. For this purpose, we applied general statistical methods to reduce the data complexity 
to manageable summaries. We were able to select co-regulated genes, "regulons" (which 
allow the identification of consensus promoter elements), but more sophisticated analysis 
tools and a higher number of available expression profiles may indeed enable us to visualize 
the transcriptome of a plant or plant tissue under specific conditions. 

The newly obtained gene groups may be useful in developing more precise screens for 
mutants with specific traits and in lead optimization for agrochemicals. In addition, they may 
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be helpfUl in predicting disease resistance spectra in other mutants. Furthermore, gene 
expression profiles may also be helpful in deriving gene functions, because it provides 
information on gene expression of many genes under many conditions. In conclusion, gene 
profiling is a powerful tool in understanding signaling cascades, and their interactions. 

Other questions pertaining to the cell metabolism and its regulation require however the 
use of other techniques. We utilized again the genetic variations in the SAR signaling cascade 
to follow changes in the proteome and the "metabolome" (entirety of the plant's metabolic 
products) under SAR induction. However, the observed changes at transcriptional level were 
not reflected at protein or metabolite level, as detected by 2D gel electrophoresis or HPLC 
separation. Future technical advances will be helpful to study the metabolic changes during 
SAR in a systematic way and may also allow the analysis of protein-protein interactions 
through the large-scale isolation of protein complexes in SAR signaling (Blackstock and 
Weir, 1999). 

• Gene cloning 

During SAR, a major change in gene expression is induced, that is regulated by a 
complex network of genetic switches. Several of these switches have been mutated in the cim 
mutants and the genetic cloning will provide a valuable step in understanding plant disease 
resistance. Several candidate genes have been cloned during this work that await genetic 
confirmation. 

One central regulator gene of the SAR signaling cascade, NIMJ/NPR1 was cloned 
independently by two groups and seems to be part of a signal transduction cascade with 
homology to the mammalian IkB/NF-kB pathway (Ryals et al, 1997; Cao et al, 1997; 
Baeuerle and Baltimore, 1988). Interesting parallels to this conserved pathway in the innate 
immune response of Drosophila and mammals to the defense response of plants have been 
drawn (Belvin and Anderson, 1996). Besides the homology of several R genes to 
Toll/interleukin-1 receptors (Whitham et al., 1994), members of the kinase cascade may also 
be shared (PTO, FEN; Martin et al, 1994; Salmeron et al, 1996). Finally, small 
antimicrobial peptides, such as defensins, are well conserved among plants and insects 
(Hancock et al., 1995). Knowing more elements in the cascade will not only allow us to 
compare this wide-spread signal transduction cascade but also to study molecular interactions, 
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for instance using the two-hybrid system, and to study physiological effects of overexpression 
and repression of the genes. 

The applications of dm genes in agriculture are potentially very promising: Several 
genes involved in the regulation of various induced defense responses in plants have already 
been successfully used to engineer disease resistant plants, including the NIM1INPR1 gene 
and the NDR1 gene mArabidopsis (Cao et al, 1998). 

EXPERIMENTAL - MATERIALS AND METHODS 
1 . Chemicals, Enzymes and Radioisotopes: 

All chemicals and organic solvents are purchased, if not otherwise stated, from Sigma 
Chemicals (St. Louis, MO), Fisher Chemicals (Pittsburgh, PA), or BioRad (Hercules, CA). 
Chemicals for culture media are obtained from Difco Laboratories (Detroit, MI) or from 
GibcoBRL Lifescience (Gaithersburg, MD). Luciferin (potassium salt) is purchased from 
Biosynth (Staad, Switzerland), the detergent SilWet L77 is from Union Carbide Chemicals 
(Danbury, CT). 

Restriction enzymes, T4 DNA polymerase and T4 DNA ligase are purchased from New 
England Biolabs (Boston, MA) or, if not available from this provider, from Boehringer 
Mannheim (Indianapolis, IN) or Stratagen (La Jolla, CA). Lysozyme, bovine serum albumin 
fraction IV and V, and ribonuclease A are delivered by Sigma. For standard PCR, AmpliTaq 
Gold® from Perkin-Elmer (Foster City, CA) or PCR beads ("Ready-to-go™"; Pharmacia 
Biotech Inc., Piscataway, NJ) are used. Long range PCR is performed using a special enzyme 
mix, rTth DNA polymerase (Perkin-Elmer, XL-PCR kit). As DNA size ladder, either the 
lamba DNA-//mdm digest (1-23 kb), the phiX174 DNA-Z/aelH digest (0.1 - lkb), or the 1 kb 
ladder (1-10 kb) from New England Biolabs is employed. Plasmid minipreparations are 
prepared using Promega's wizard® miniprep kit (Promega Corp., Madison, WI). For 
maxipreparations, Qiagen's maxiprep kit (Qiagen Inc., Chatsworth, CA) is used. 

Nucleic acids are separated on agarose gels (low EEO, Sigma). For preparative gel 
electrophoresis, low melt SeaPlaque GTG agarose is used (FMC byproducts, Rockland, ME), 
for electrophoretic separation of SSLP genetic markers, Metaphor gels are used (FMC) and 
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for separation of smaller fragments, as well as for heteroduplex analysis, 10% 19:1 
polyacrylamide gels (BioRad) arc used. For documentation, nucleic acids separated on gels 
are photographed on Polaroid black and white print film, iso3000/36° (Cambridge, MA). For 
blotting, nucleic acids are transferred onto GeneScreen Plus membranes (NEN™ Life Science 
Products, Boston, MA) or Hybond™-N-f- membranes (Amersham, Arlington Heights, IL). 
Random primer DNA labeling mix is obtained from GibcoBRL Lifescience. Radioisotopes 
[<x- 32 P] dCTP are delivered by International Biotechnologies Inc. (New Haven, CT). 
Radioactive signals are visualized on Kodak X-OMAT film (Rochester, NY). 

2. Common Media and Buffers: 

For the preparation of the buffers and media, double-autoclaved water is used. Prior to 
use, the solutions are autoclaved or filter-sterilized through 2 \xrn filters (Nalgene, Rochester, 
NY). 

LB 10 g/1 Bacto tryptone 

5 g/1 Bacto yeast extract 
170mM NaCl 

SOC 20 g/1 Bacto tryptone 

5 g/1 Bacto yeast extract 
10 mM NaCl 
2.5 mM KC1 

GM medium (Murashige and Skoog, 1962) 

4.39 g/1 M&S standard medium 

10.0 g/1 sucrose 

0.5 g/1 MES 

1 mg/1 thiamine 

0.5 mg/1 pyridoxine 

0.5 mg/1 nicotinic acid 
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100 mg/1 inositol 



YT medium 



8 g/1 Bacto tryptone 
5 g/1 Bacto yeast extract 
90 mM NaCl 
pH to 7.3 
15 g/1 Agar 



YPDL medium 



1 0 g/1 Bacto tryptone 
5 g/1 Bacto yeast extract 
170mMNaCl 
20mMMgSO 4 

(for plates, supplemented with 15 g agar per liter) 



Kings B medium 3 8 g/1 Pseudomonas Agar F 
15 ml/1 Glycerol 

(for plates, supplemented with 15 g agar per liter) 

Media are supplemented with antibiotics, if needed: 50 mg/1 kanamycin, 50 mg/1 
ampicillin, 50 mg/1 rifampicin, 15 mg/1 tetracyclic 25 mg/1 chloramphenicol, or 30 mg/1 
hygromycin. 



IM medium 



4 39 g/1 M&S standard medium 
5.0 g/1 sucrose 

10 ml/1 6-benzylaminopurine, lmg/ml solution in 1% KOH 



MSE, 10 x 



0.2 M MOPS acid 

50 mM sodium acetate, pH 5.2 

lOmMEDTA 

pH to 7.0 with NaOH 
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TAE buffer, 10 x 



48.4 g/1 Tris base 

1 1.4 ml/1 acetic acid, glacial 

1 0 mM EDTA, pH 8.0 
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TBE buffer, 10 x 



0.89 M Tris-base 
0.89 M boric acid 
20mMEDTA,pH8.0 



TE, pH 8.0 (pH 7.5) 10 mM Tris-HCl, pH 8.0 (resp. pH 7.5) 
1 mM EDTA 

RNA sample buffer 50 ml formamide/bromophenol blue (10:1) 
1 7 ml formaldehyde 
10 ml lOxMSE 

0.4 ml ethidium bromide ( 1 0 mg/1) 



SSC, 20 x 



3 M NaCl 

0.3 M citric acid, trisodium salt 
pH 7.0, adjusted with 10 N NaOH 



STE buffer 



lOmMEDTA 
lOOmMNaCl 
20 mM Tris-HCl, pH 7.5 



CTAB buffer 



2% w/v CTAB 

100 mM Tris-HCl, pH 8.0 

20 mM EDTA 

1.4 M NaCl 

1%w/vPVP 40000 



Phenol is saturated with Tris-HCl pH 8.0 except for use in the Trypan Blue stain mix. 
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3 . Biological Materials 



A. Plants 

Arabidopsis thaliana (Heynh.) ecotypes Wassilewskija (Ws-0); Columbia (Col-0\ and 
Landsberg erecta (Ler) are obtained from Lehle Seeds (Round Rock, TX). A hygromycin 
resistant NahG line in the Col-0 background is used for crosses. 

B, Microorganisms 
E. coli strains 

DH5aF FY endAl 9 recA\ 9 gyrA96, thi-1, hsdRll (rkmk*), supE44 9 relA\ 9 

deoR, ($80dlacA(lacZ)Ml5) A(/ac/ZZ4-ar^F) (Hanahan, 1983) 

XL1 Blue FV lacl q A(lacZ)Ml5, end Al, recAl , gyrA96> thi-1, hsdRll (ik mk*), 
supE44 9 relAl (Bullock et al 9 1987) 

XL2 Blue MRF' F7 lacF A(!acZ)M15 9 endAh recAl, gyrA96 9 thi-1 9 hsdRll (rk* 
mk"), supE44 9 relAl 9 A(mcrA)l83, A(mcrCB-hsdSMR-mrr)lll 
(Stratagene) 



DH10B F" mcrA A(mcrCB~hsdSMR-mrr) endAl, recAl, gyrA96 9 thi-1 9 

hsdRll (rkmk*), supE44 9 relA\ 9 deoR, ($>80dlacA(lacZ)Ml5) 
A(lacIZYA-arg¥) (GibcoBRL) 

Yeast Strains 

Saccharomyces cerevisiae strain AB1380: Matcc,/?.s/+, ura3-52 9 trpl 9 ade2-l 9 canl- 
100, lys2-l 9 his5 (Burke et aL 1987) 

Agrobacterium tumefaciens 
Agrobacterium tumefaciens strain GV3101, containing the pMP90 vir plasmid 
(Koncz and Schell, 1986) 
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Pseudomonas syringae strains 

Pseudomonas syringae pv. tomato DC 3000, Rif (Whalen et al, 1991) 
Pseudomonas syringae pv. maculicola ES4326, Kan r (Dong et at., 1991) 



Peronospora parasitica 



pathovar 


susceptible A. thaliana ecotype 


Source 


Noco2 


Col-0 


J. Parker, Sainsbury Laboratory, Norwich, 






England 


Emco5 


CoUO 


B. Mauch-Mani, A. Slusarenko, Inst, fur 






Pflanzenbiologie, Zurich, Switzerland 


Emwa 


Ws-0 


E. Holub, L Crute, Horticultural Research 






Station, E. Mailing, England 



Erysiphe cichoracearum 

Erysiphe cichoracearum strain UCSC is provided by R. Innes (Indiana University, IN). 



C. Vectors 
Plasmids 

pUC19 colEl ori, lacZa, MCS, Amp r (Yanish-Perron et al., 1985) 

pBeloBAC-Kan ori S, Kan f , repE, par A parB, cosN, MCS, loxP, lacZ (Mozo et al. , 
1998) 

pBeloBAC 1 1 ori S, Chl r , repE, parAparB, cosN, MCS, loxP, lacZ (Kim et al, 
1996) 

P YAC4 URA3, SUP4, CEN4, ARSJ, TRP1, Amp r , pBR322, TEL HIS3 TEL 

(Burke al., 1987) 

pYAC41, dito, but modified restriction sites (Albertsen et al., 1990) 
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PCIB200 LB NosAtf>77/RB, oriT, oriV 9 Kan r 

Phagemids 

pBluescript SKH(+/-) colEl orijl ori> lacZ, MCS, Amp r ((Short et aL, 1988), and 
Stratagene) 

pHD-1 is identical to pBluescript but contains a polylinker cloned in 
the Notl site (Hofte et al , 1 993). 

pZLl colEl ori t fl ori, lacZ, lacIJncA, loxP, MCS, Amp r (GibcoBRL) 

pCR™2.1 colEl orUfl ori, lacZa, MCS, Amp r , Kan r (Invitrogen Corp., Carlsbad, CA) 

Libraries and stock collections 

YAC clones, BAC clones and BAC library filters (TAMU and IGF libraries) are 

obtained from the ABRC stock center (Ohio State University, OH). 

Cosmid library pOCAl 8 (binary vector, bacterial selection: Tet r , plant selection: 
35S/hyg r , 10 genome equivalents Ws-0 inserts, average size 25 kb; 
Schulz et al, 1994), obtained from ABRC stock center 

MSU est collection Amp r , lambda ZipLox (LifeScience BRL) Sali-Notl, or lambda 
ZAPII (Stratagene) EcdBl-Notl cDNA cloning (G.5 to 6 kb), obtained 
from the ABRC stock center as excised plasmids. 

4. General Laboratory Techniques 

All common molecular biological methods, as described for instance in (Ausubel et al. y 
1987; Sambrook et al 9 1989) are performed accordingly and are not described separately. 

A. Restriction endonuclease digest of DNA 

The endonucleolytic cleavage of DNA by restriction enzymes is carried out according to 
the manufactures specifications. For CAPS marker development, 5 ^1 of a 25 jil PCR reaction 
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is used in a 20 ^1 digest. For plasmid rescue and Southern blot analysis, including RFLP 
analysis, digests are performed in 200 }xl volume, using up to 3 [ig DNA. 

B. Purification of DNA fragments 

DNA fragments are either gel purified or, for PCR-generated fragments, purified in 
solution. For gel purification, small pieces of low-melt agarose are isolated and DNA is 
extracted using the Advantage™ PCR pure Kit (Clontech, Palo Alto, CA). For DNA in 
solution (e.g. PCR fragments), the GeneClean m kit (Bio 101, Inc., Vista, CA) is used 
according to the manufactures recommendations. 

C. Ligation 

Ligations of DNA fragments to vector DNA are performed according to standard 
protocols (Sambrook et aL, 1989). Usually, 50 to 100 ng vector DNA are mixed with a two- 
to threefold excess of fragment DNA in a 30 \il reaction volume. The reactions are performed 
overnight at 16°C for both sticky and blunt end ligations. One to 5 units T4 ligase are used 
per reaction. PCR fragments are cloned by TOPO-TA cloning following the instructions of 
the manufacturer (Invitrogen). 

For plasmid rescue, 2 ng of digested genomic DNA are resuspended in 358 \i\ water, 
supplemented with 40 ptl ligase buffer and ligated in presence of 2 ^1 T4 ligase (12 Weiss 
units). 

D. Preparation of heat-shock competent E. coli cells 

A 100 ml LB culture is inoculated with 0.5 ml of a liquid overnight culture of E. coli 
DH5a and grown with shaking at 37°C until an optical density (O. D. 6 oo) of 0.5 has been 
reached. Cultures are chilled on ice and cells are collected by centrifugation (5 min, 5000g). 
Cells are resuspended in 7.5 ml transformation buffer I (100 mM RbCl 2 , 45 mM MgCl 2 , 35 
mM potassium acetate, 10 mM CaCl 2 , 0.5 mM LiCl, 1 5% glycerin, pH 5.8) and incubated for 
1 0 min on ice. Cells are spun down as before and resuspended in 4 ml transformation buffer 
H (10 mM MOPS, 75 mM CaCl 2 , 10 mM RbCl 2 , 15% glyerin, pH 7.0). Aliquots of 100 ^1 
are rapidly frozen and stored at -80°C. 
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E. Transformation of competent E. coli cells 

Plasmids are transformed into E. coli using a modified version of the heat-shock 
protocol (Dagert and Ehrlich, 1979), An aliquot of competent cells is thawed on ice and 
incubated for 10 min with the DNA. After a heat pulse (1 min, 42°C), the cells are again 
incubated on ice for 2 min. 200 jxl SOC media at room temperature are added and the mixture 
is incubated at 37°C for an hour. Cells are plated on LB plates containing the selective 
antibiotics and in appropriate cases isopropylthiogalactose (IPTG) and 5-bromo-4-chloro-3- 
indoyl-P-D-galactose (X-Gat). 

For high efficiency transformation, Epicurian Coli® ultracompetent cells (E. coli XL-2 
Blue; Stratagene) are transformed following the manufactures instructions. 

F. Transformation of competent Agrobacterium cells 

To transform a binary vector into Agrobacterium, 40jil electrocompetent Agrobacterium 
cells are thawed on ice and 2 to 1 0 ng plasmid DNA is added. The mixture is transferred into 
aprechilled 0.2 ml electroporation cuvette (BioRad) and the cells are electroporated at 2.0 
Volts, 600 Ohms, 25 piFarad, 6 msec time constant using a Gene Pulser (BioRad), 
Immediately, 1 ml of 2 x YT medium is added and the suspension is incubated at 37°C for one 
hour under shaking. Cells are collected by centrifugation, resuspended in a small volume LB 
medium and spread onto LB plates containing the appropriate antibiotic (Kanamycin for 
pCIB200). Plates are inoculated 2 to 3 days at 28°C before inoculating 50 ml liquid LB 
cultures (supplemented with kanamycin and rifampicin) for transformation. 10 ml of this 
culture are used after 24 - 36 hours incubation at 28°C to inoculate 500 ml LB cultures. To 
verify that the plasmid is correctly transformed, plasmids are isolated following a lysis in 5 M 
NaCl, 20% sarkosyl solution and then the protocol of the Wizard plasmid minipreparation is 
followed (Promega, section 2.4.7). 

G. Plasmid minipreparation 

The preparation of small amount plasmid DNA is carried out following a method by 
Birnboim and Doly (1979). 3 ml overnight cultures are concentrated by centrifugation, and 
resuspended in 200 jlxI solution I (50 mM glucose, 25 mM Tris-HCl, pH 8.0, 10 mM EDTA). 
The solution is placed on ice and 200 nl of solution II (0.2 M NaOH, 1% SDS) are added to 
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lyse the cells. Cell debris is precipitated with 200 \il 5 M potassium acetate and separated 
from the supernatant by centrifugation. DNA is precipitated from the supernatant by 1/10 
volume sodium acetate and 2 volumes ethanol. The pellet is resuspended in 50 ^1 water. 

Alternatively, the Wizard plasmid minipreparation kit (Promega) is used for 3 ml 
overnight cultures according to the manufactures recommendations. 

DNA Sequencing 

DNA Sequencing is done according to the Sanger 2 %3*-didesoxy technology (Sanger et 
at. , 1 977), using the big Dye terminator ready reaction mix (ABI/Advanced Biotechnologies, 
Inc., Columbia, MD), supplemented with 2 mM MgCl 2j 80 mM Tris-HCl, pH 8.0 buffer. For 
DNA fragments bigger than 5 kb, transposon-mediated sequencing is carried out (Kimmel et 
al 9 1 997) using the primer island transposition kit (Perkin Elmer). All sequencing reactions 
are carried out in Peltier Thermal cyclers (MJ Research Inc., Watertown, MA) and loaded 
onto 5% acrylamide long-range gels (FMC ready mix). Fluorescence is read by an ABI Prism 
377 DNA sequencer (ABI) and bases are called using Phred/Frap/Consed software (University 
of Washington, Seattle, WA; Ewing et al, 1998; Gordon et aL, 1998). 

For sequence assembly and comparison, and restriction site mapping, the sequencer 
software (Gene Codes Corp., Ann Arbor, MI; Version 4.0 for Windows) is used. For 
similarity searches, BLAST2 software (Altschul et aL, 1990) are run at NCBI against 
GenBank (www/ncbi.nlm.nih.gov/BLAST/) or against the Arabidopsis thaliana database 
(http://genome-www2,stanford,edu/cgi-bin/AtDB/nph-blast2atdbV 

5. Plant Care 

A. Plant growth conditions 

A. thaliana (L.) Heynh. ecotypes Ler, Ws-0 and Col-0 are sown in 200-ml containers in 
an all-purpose soil mix (Germination Mix, superfine C. Fafard Inc.; Agawam, MA) that has 
been autoclaved twice for 70 min or once for 2 hours after 24 h hydration to allow fungal 
sporulation. The seeds are surface-sterilized with bleach (50% v/v commercial bleach, 0.01% 
v/v sodium dodecyl sulfate or another wettable agent) for 5 min and for 5 min with 80% 
ethanol, and washed several times in sterile distilled water before sowing. Plants are grown at 
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20 - 24°C, 60% relative humidity, 9 hr day/15 hr night (short day, SD), 250 jiE/m 2 s. Prior to 
germination, the flats are covered with plastic domes. For older plants, the soil surface is 
allowed to dry between waterings. Alternatively, plants are cultivated on GM-agarose in petri 
dishes in 0 ,1% GM top agar under sterile conditions and either SD or long day (LD; 15 hr day, 
9 hr night) in high densities (up to 1 0 plants per 1 cm 2 ). 

B. Pollination 

Cross-pollination of A, thaliana is performed on half-closed buds of flowers from the 
female parent plant. It is confirmed with the aid of a dissecting microscope that the anthers 
have not yet released pollen on the stigma. From the male parent plant, a dehiscing anther is 
removed with forceps and pollen is transferred to the stigma of the female parent. 

C. Transformation of Arabidopsis thaliana by Agrobacterium 

Plants are genetically transformed using an adapted protocol of vacuum infiltration 
(described by Bechtold et al (1993)). A. thaliana plants, which have bolted and just started 
flower formation (4-5 weeks old) are immersed in infiltration media (M) containing a 
washed 3 liter late log phase culture of A, tumefaciens GV3 101 (O.D. 600 = 0.7 or above), 
previously electrotransformed with a binary vector (section 4F). Infiltration is accomplished 
by creating and releasing a vacuum in the chamber containing the plants. Plants are then 
cultured as described above and the seeds are harvested and subjected to selection either on 
GM plates containing 50 mg/1 Kanamycin (Valvekens et al, 1988) or on soil with 3 to 4 spray 
treatments of 160 mg/1 Basta® (glufosonate ammonium) in the first two weeks of 
development in order to identify the transformants (Akama et al, 1995). 

D. Treatment of plants with chemical inducers of SAR 

Plants growing on soil are sprayed with S A (5 mM), INA (0.375 mM) or BTH solutions 
(0.1 - 5.0 mM, usually 0,3 mM) containing 0.01% v/v of a surface-active agent (SilWet L77). 
Plants cultivated on agar plates are exposed permanently to the inducers mixed in the GM- 
medium at different concentration ranges (BTH up to 250 |xM, INA up to 300 jaM and SA up 
to 150 fiM final concentration in the plate). 
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E. Luciferase activity determination in vivo 

To monitor luciferase activity on living plants, light emission is measured in a photon 
imaging device (Hamamatsu Inc. Tokyo) equipped with an ARGUS 50 photon-counting 
image-processor 15 min after the plants are sprayed with a luciferin solution (7.5 mM 
luciferin, 0,1% v/v Silwet 77). The most sensitive regulation and intensified CCD microscope 
video cameras (C2400-25 VIM) are used during signal integration for 10 min at 20°C. Plants 
that gave a positive measurement of light emission are identified, isolated and cultivated for 
further investigations. 

6. Pathogen Treatment 

A. Treatment with and culture of Peronospora 

To maintain fungal stocks, weekly transfers of P. parasitica are made. Transfers are 
accomplished by spraying oospores on compatible A. thaliana cultivars that are grown under 
high relative humidity (95%) at 1 5°C in a culture chamber. 

For pathogenicity tests, Peronospora oospores are isolated from infected A thaliana 
leaves by vortexing the leaves in distilled water. Spores are counted in a hemacytometer and 
the concentration is adjusted to 10 5 - 10 6 spores per ml. The supernatant is then used directly 
either to spray the planosphaere (Dietrich et al., 1994) or to infiltrate the leaf apoplast of 
Arabidopsis thaliana with a 1 ml syringe gently pressed onto the sub facial leafside. 

B. Erysiphe inoculations 

Resistance to Erysiphe cichoracearum strain UCSC is tested by brushing sporulating 
CoUQ plants onto four weeks old plants, as described by Frye and Innes (1998). Disease is 
scored 10 days after inoculation on at least 10 plants per mutant, according to the following 
rating: rating 1: 0-1 leaf per plant showed hyphal growth; rating 2: 2 - 4 leaves per plant 
showed hyphal growth; rating 3; more than 4 leaves per plant showed hyphal growth. Mean 
values and standard deviations are calculated for one experiment. The experiments are 
repeated three times. 
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C. Pseudomonas infections 

For the analysis of resistance to compatible phytopathogenic bacteria, the apoplast of 
leaves of four weeks old cim plants, BTH-activated Col-Q (0.3 mM, 2 days prior to infection) 
and water-treated Col-0 control plants are injected with Pseudomonas syringae pv. maculicola 
ES 4326 (Schott et al, 1990) or Pseudomonas syringae pv. tomato DC3000 (Dong et al, 
1 99 1) at 2 x 1 0 5 cells per ml Samples are taken at 0, 1, 3 and 5 days after injection. For each 
time point, four leaf punches are pooled, ground in 10 mM MgCl 2 and plated in appropriate 
dilutions on Kings B medium supplemented with rifampicin (50|ug/ml), or kanamycin (100 
Hg/ml). Standard deviations are calculated from four independent experiments. The 
significance of differences between mean values is evaluated by Student's t test. Differences 
are considered to be significant at P > 0.6. 

To monitor the response of cim mutants to avirulent Pseudomonas strains, high 
concentrations (5 x 10 7 cells per ml) of Pseudomonas syringae pv. tomato DC3000 avrPt2 or 
avrRpml (cloned into pVSP61) are infiltrated into leaves and symptom development is 
compared to wild-type and BTH pretreated plants over a period of 48 hours. 

Before each pathogenicity test, cim mutants are selected based on in vivo expression of 
the Pif-i/luciferase gene. For crosses of cim mutants to the NahG line, where .PR-7/luciferase 
expression is suppressed in all cases, resistance is evaluated on population level. 

7. Staining Methods 

A. Trypan Blue staining for cell death and fungal infection 

In order to detect lesions and fungal hyphae on leaves, a protocol by Keogh (Keogh et 
al. y 1980) is adapted for Peronospora (Koch and Slusarenko, 1990). Leaves are covered with 
prewarmed (65°C) Trypan Blue-stain mix (9.3 ml phenol, 1 0 ml lactic acid, 1 0 ml glycerol, 10 
ml Trypan Blue (1 mg/ml)). The dye is infiltrated by repeated vacuum application and release 
followed by brief boiling in a microwave. After an one-hour incubation at room temperature, 
leaves are cleared in a nearly-saturated (250% w/w) chloral hydrate solution overnight and 
slides are prepared the next day. 
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B. 3,3 , -dihexylocarbocyanine iodide [DiOC 6 (3)] staining for Erysiphe 

To visualize Erysiphe infection and fungal structures, a fluorescence dye staining is 
performed on infected leaves (Duckett and Read, 1991). Leaves are incubated for 2 min in 50 
Hg/ml [DiOC 6 (3)] stain (Sigma Chemical Corp.), cleared for 30 sec in distilled water and 
mounted in water under a coverslip. Fluorescent fungal hyphae are detected at 520 nra after 
blue light excitation (450 - 490 nm) with an epifluorescence microscope (Leitz, Wetzlar, 
Germany). 

C. Callose staining 

Callose depositions are detected using an aniline blue stain on 5 \xm thick leaf sections 
(Hunt et al, 1997). Leaves are fixed in 10% formaldehyde solution (45% ethanol, 10% 
formaldehyde, 5% acetic acid), and embedded in paraffin blocks. Microtom leaf sections 
(made by Experimental Pathology Laboratory, Durham, NC) are mounted on microscope 
slides and deparaffinated by two successive 5 min incubations in 100% xylene, two 5 min 
incubations in 100% ethanol and one 5 min incubation in each of the following: 75% ethanol, 
40% ethanol, and water. For callose staining, samples are incubated for 5 min in 0.15 M 
K 2 HP0 4 and 0,01% aniline blue. Samples are mount in 70% glycerol, 30% aniline blue stain 
and visualized using ultraviolet epifluorescence (390 - 430 nm) as described in Dietrich et al 
(Dietrich et al, 1994). 

8. Biochemical Methods 

A. Luciferase in vitro assay 

A modification of the extraction method described by Millar et al (1992) is used to 
assay luciferase activity in vitro. Plant tissue is ground in liquid nitrogen, resuspended in 500 
Hi prechilled grinding buffer (1 0 mM phosphate buffer, pH 7.5, 0.1 mM DTE), and 
centrifuged for 10 min (lOOOg, 4°C), 100 \x\ of the supernatant are mixed with 100 \x\ 
luciferin substrate (Promega Corp., Madison, WI or Biosynth International Inc., Naperville, 
IL) and light emission is integrated for 10 sec by a luminometer (Monolight 2010, Analytical 
Luminescence Laboratory; Ann Arbor, MI). 
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To determine the total protein content in the samples, the absorbence of 5 |xl of the 
supernatant is measured at 595 nm after the addition of 20% v/v protein assay-solution 
(BioRad; Hercules, CA) in a total volume of 500 

B. PAL enzyme activity assay 

0.5 g tissue, in presence of 50 ?g Dowex 1X2-200 (Sigma Chemical Corp.) and quartz 
sand are ground in liquid nitrogen. After addition of 1 ml extraction buffer (0.2 M Tris-HCl, 
pH 7.8, 14 mM P-mercaptoethanol), the solution is incubated 10 min on ice and the cell debris 
is separated by centrifiigation at 4°C (10 min, 20,000g). For the enzyme assay, 100 nl of the 
supernatant are mixed with 400 ^1 0.1 M Tris-HCl, pH 8.8, and 500 \il 20 mM phenylalanine 
(Sigma Chemical Corp.) dissolved in Tris-HCl, pH 8.8. PAL deaminates phenylalanine to 
cinnamic acid whose concentration can be measured by absorption spectroscopy at 290 nm 
(Xmax at 275 nm; e = 20000 cm^M" 1 ). The reaction is incubated at 30°C, and the O. D, 290 is 
measured at 10 min, 30 min, 60 min and 90 min after reaction start. PAL activity in jiKat/kg 
protein can be calculated according to: AE/h x 27.8/mg protein (Kombrink and Hahlbrock, 
1986). Protein content is determined as described above (8A) using the Bradford reagent. 

C. Salicylic acid extraction 

In order to measure the content of free salicylic acid and its monosaccharide conjugates, 
samples are harvested in triplicates and analyzed as previously described (Enyedi et aL, 1992; 
Uknes et aL, 1993). 0.3 g of frozen, ground A thaliana leaves are extracted with 3 ml 90% 
methanol during 20 minutes of sonication. After centrifiigation at 4000g for 20 min, the pellet 
is further extracted with 2 ml 100% methanol and then spun down again (4000g, 20 min), and 
the two supernatants are combined. Samples are split into two equal parts and dried in a 
speedvac. The first series of samples (free SA) is suspended in 2.5 ml 5% trichloroacetic acid 
(TCA; 5 min sonication) and SA is extracted twice with 2.5 ml extraction buffer. The second 
series (total SA) is first subjected to hydrolysis with 40 units P-glucosidase (Sigma Chemical 
Corp.) in 0.1 M sodium acetate (pH 5.2) at 37°C for 1.5 hours. 2 ml 5% TCA is then added 
and the acidic phase is extracted three times with 2.5 ml extraction buffer (ethylacetate : 
cyclopentane : isopropanol = 100 : 99 : 1). Each extract is dried in a speedvac, resuspended in 
150 yii 20% methanol, filtered in a spincolumn (Titan-MSF nylon microsample filters, 0.2 
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jim; SRI Scientific resources Inc.; Eatontown, NJ) and transferred to an HPLC autosampler 
vial. For chromatographic separation and measurements, 50 ^il of each extract is injected in a 
C-18 HPLC column (Dynamax 60, Rainin Instrument Comp.; Wobum, MA). SA is eluted at 
44°C in a 15 min linear gradient (solution A: 2 mM aqueous sodium acetate plus 5% - 30% 
solution B: 2 mM sodium acetate in 70% methanol, followed by 100% solution B and 
reequilibration for 5 min in 5% solution B) and detected by a fluorescence detector at the 
column end after excitation at X = 295 nm. Area values are converted to concentrations using 
a standard curve. 

D. Camalexin extraction 

For camalexin extraction, triplicates of 3 leaf discs each (Cork borer #1) are pooled and 
extracted in hot methanol according to Dr. J. Glazebrook (http. //genome- 
www.stanford.edu/yiraWdo^ Extracts are nm on a Si250 TLC 

plate (J. T. Baker) in ethyl acetate : hexane (87:13) mobile phase. Tissue induced by 
Pseudomonas syringae pv. maculicola ES 4326 harvested 5 days after inoculation served as a 
control for extractions, synthetic camalexin (provided by Dr. T. Maetzke, Novartis Inc.) 
served as a standard during thin layer chromatography. Camalexin is detected under a long- 
wave ultraviolet lamp. 

9. Extraction Of Nucleic Acids 

A. Extraction of plant RNA 

Total RNA is isolated from 1 g frozen, powderized leaf tissue that is ground to a fine 
powder in liquid nitrogen. The samples are resuspended in 2.5 ml RNA extraction medium 
(50 mM Tris-HCl, pH 8.0, 4% w/v /?-amino salicylic acid, 1% w/v 1,5-naphtalene disulfonic 
acid (Arcos Chemicals, NJ) and 2.5 ml water-saturated phenol (Lagrimini et aL> 1987). After 
addition of 2.5 ml chloroform, phases are separated by centrifugation (10 min at 7000g). The 
aqueous phase is transferred to a new tube and nucleic acids are precipitated with the addition 
of 1/10 volume 3 M sodium acetate, pH 5.2 and 2 volumes ethanol at -20°C for 30 min. 
Precipitates are spun down (10 min at 7000g) and the dried pellets are resuspended in 2 ml 
double-distilled water. RNA is precipitated overnight at 4°C with the addition of 1 .25 ml 8 M 
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LiCl. The precipitate is pelleted by centrifugation (10 min at 7000g) and the pellet is rinsed 
with 80% ethanol. RNA pellets are resuspended in 100 p.1 water and the absorbence at 260 
nm and 280 nm are measured in a spectrophotometer (UV-160 visible recording 
spectrophotometer, Shimadzu; Columbia, SC) to determine the amount and the purity of the 
RNA. 

For one-leaf extractions, a modification of the above protocol is used. The leaf is 
ground in liquid nitrogen, and 500 }il of an 1 : 1 mixture phenol : extraction buffer (0. 1 M LiCl, 
100 mM Tris-HCl, pH 8.0, 10 mM EDTA, pH 8.0, 1% SDS) at 80°C are added. After adding 
250 pi chloroform, the phases are separated by centrifugation and nucleic acids in the aqueous 
phase are precipitated with sodium acetate and ethanol at -20°C. The pellet is resuspended in 
8 jil water 

B. PolyA RNA purification 

PolyA RNA is enriched using a poly dT magnetic bead technique (Promega). One mg 
total RNA is hybridized to the poly dT-biotin nucleotides according to the manufactures 
recommendations. Avidin-coated magnetic beads allowed the separation of polyA RNA from 
non poly-adenylated RNA. The RNA is released from the poly dT probes in low salt buffers 
and concentrated by precipitation with ethanol. 10% of the obtained polyA RNA is used for 
spectrometric analysis. 

C. Plant DNA minipreparation 

Plant DNA is extracted using the CTAB-method described by Rogers and Bendich 
(1988). 1 - 2 leaves per sample are ground in liquid nitrogen with a Polytron (Brinkmann 
Instruments Inc. Westbury, NY), then vortexed with 200 pi 2 x CTAB buffer. After heating at 
65 °C for 15 min, 200 pi chloroform are added and the well-mixed extraction is centrifuged 
for 2 min (1 0,000g). DNA is precipitated from the resulting supernatant with 3 volumes of 
ethanol at -20°C. The precipitate is spun down (10 5 000g, 15 min) and the pellet is rinsed with 
70% ethanol. After drying, the pellet is resuspended in 30 pi 10 mM Tris-HCl, pH 8.5, 
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D. Plant DNA extraction for Southern blot analysis and pooled progeny analysis (F3 
populations) 

1 to 2 g ground tissue are mixed on ice with 12 ml extraction buffer (0.1 M Tris-HCl, 
pH 8.0, 50 mM EDTA, pH 8.0, 0.5 mM NaCl, 10 mM p-mercaptoethanol; Dellaporta et al y 
1983). After adding 0.8 ml 20% SDS, the extract is incubated at 65°C for 10 min. Cell debris 
is precipitated with 4 ml 5 M potassium acetate during a 20 min incubation at 4°C and 
separated from the supernatant by centrifugation at 4°C (10 min, 8000g). The supernatant is 
filtered through prewetted Miracloth and DNA is precipitated with 8 ml isopropanol at -80°C 
(30 min). DNA is pelleted, dried and resuspended in 4 ml TE and again precipitated with 
sodium acetate and ethanol. After resuspension in 400 jo.1 TE, an RNase A digest is performed 
for 10 min at 37°C (final concentration RNase A: 50 fig/ml). The samples are extracted once 
with a 1 :1 mixture of phenol : chloroform, once with chloroform, precipitated by addition of 
sodium acetate and ethanol and resuspended in 50 \i\ water. For less tissue (15 - 500 mg), the 
protocol is scaled down 20- fold and the RNase treatment is omitted. 10 (xl is used in 
restriction digests for Southern blot analysis. 

E. BAC DNA minipreparation 

BAC DNA minipreparations are done according to a protocol by Sinnett et al (1998), 
using a modified alkaline lysis method. 3 ml LB overnight cultures containing either 50 ?g/ml 
kanamycin (IGF BACs) or 12.5 fig/ml chloramphenicol (TAMU BACs) are pelleted and 
resuspended in 100 ^1 of chilled resuspension solution (25 mM Tris-HCl, pH 8.0, 50 mM 
glucose, 10 mM EDTA, pH 8.0). Cells are lysed by adding 200 jil lysis buffer (0.2 ATNaOH, 
1% SDS). Cell debris and chromosomal DNA are precipitated by 150 \x\ 5 M potassium 
acetate, pH 4.8. After a 5 min centrifugation at maximal speed in a tabletop centrifuge, the 
supernatant is transferred to a new tube and the crude DNA is precipitated by adding 2 
volumes of ethanol. The DNA is pelleted for 5 min as before, washed in 70% ethanol and 
resuspended in 100 nl of TE buffer containing 0.1% SDS and 100 ng/ml proteinase K, 
followed by a one-hour incubation at 37°C. The reaction is extracted with 100 ul phenol : 
chloroform (1:1), then with 100 \xl chloroform. The DNA is precipitated with 2 volumes 
ethanol and washed as before, then resuspended in 50 ^il water. 
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R BAC DNA maxipreparations 

To obtain bigger quantities of BAC DNA, a protocol provided by Choi et ah (1995) is 
used in a modified version. 2 liter overnight bacterial cultures (LB plus antibiotic) are 
harvested by centrifugation. The pellet is resuspended in a lysozyme solution (50 mM 
glucose, 10 mM EDTA, 25 mM Tris-HCl, pH 8.0, 5 ^g/ml lysozyme) and incubated on ice for 
5 min. 40 ml of an alkaline lysis solution (0.2 ATNaOH, 1% SDS) are added and after 5 min 
incubation at 4°C, 30 ml of ice-cold potassium acetate solution (5 M, pH 4.8) are added. 
After freezing at -80°C, the precipitate is pelleted (15 min, 15000g), the supernatant is filtered 
through miracloth, and DNA is precipitated by 0.6 volumes isopropanol at -80°C. DNA is 
pelleted (30 min, 15000g), dissolved in TE and subjected to RNase A digest (20 |*g/ml, 45 
min at 37°C). DNA is extracted with an equal volume phenol : chloroform (1:1), and with 
chloroform, then precipitated with sodium acetate and ethanol. DNA is taken up in 200 p.1 
water. 

Alternatively, the purification followed the Qiagen maxipreparation protocol for very- 
low-copy cosmids (Qiagen Inc., Valencia, CA). The DNA is taken up in 20 ^il of 10 mM 
Tris-HCl, pH 8.5. 

G* YAC DNA preparation 

To isolate yeast DNA, a protocol by Hoffman and Winston (1987) is used. Cells of a 
stationary 5 ml YEPD liquid culture are collected by centrifugation, washed in water and 
lysed in lysis buffer (2% Triton X-100, 1% SDS, lOOmM NaCl, 10 mM Tris-HCl, pH 8.0, 1 
mM EDTA, pH 8.0) in presence of 0.2 ml phenol : chloroform (1:1) and 0.2 ml glass beads 
under intensive vortexing. After adding 0.2 ml TE and centrifugation, the aqueous phase is 
extracted with chloroform and DNA is precipitated with two volumes ethanol The pellet is 
resuspended in 0.4 ml TE and the DNA is precipitated with 10 ^1 ammonium acetate and 1 ml 
ethanol. DNA is resuspended in 50 |xl TE. 
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10. Analysis Of Membrane-Bound Macromolecules 
A. Analysis of RNA 

For RNA blot analysis, 10 \ig of purified RNA (0.25 fig poly A RNA) in 15 ^1 water is 
heated for 15 min at 65°C after addition of 34 ^1 RNA sample buffer (65% v/v 
formamide:bromophenol bine (1:10), 21.5% v/v formaldehyde (37%), 13% MSE, 0.5% v/v 
ethidium bromide (10 mg/1)). The samples are loaded on a denaturing agarose-gel (1 .2% w/v 
agarose, 1 x MSE, 3% v/v formaldehyde) and run at 12 V/cm for 1 hour (Sambrook et ai, 
1989). The gel is blotted overnight in 6 x SSC (Ausubel et aL, 1987) on a GeneScreen Plus® 
membrane, and the efficiency of the RNA transfer is checked by viewing the blot under UV 
illumination. RNA is linked to the membrane by UV-crosslinking (Stratalinker®, Stratagene; 
La Jolia, CA) at 1200 \i J. After at least 1 hour prehybridization at 65°C (hybridization-buffer: 
500 mM NaP0 4) pH 7.0, 1 mM EDTA, 7% v/v SDS, 1% w/v BSA (fraction V); Church and 
Gilbert, 1984), 100 jxl of an [a- 32 P] dCTP labeled probe (random priming, Feinberg and 
Vogelstein, 1983) is added and the membrane is incubated overnight; Church and Gilbert, 
1984). The membrane is washed twice with washing buffer (40 mM NaPC>4 buffer, pH 7.0, 
lmM EDTA, 1% v/v SDS) containing 5 g/1 bovine serum albumin (BSA) (each 20 min, 65 °C) 
and once without BSA (1 5 min, 65°C). The blot is exposed for at least half an hour to a 
Phosphorlmager® screen (Molecular Dynamics; Sunnyvale, CA) and, depending upon the 
observed intensity, exposed for several hours to days on XAR-5 scientific imaging film at - 
80°C in the presence of an intensifier screen. The experiments are repeated at least twice for 
every probe. 
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Table 4: Clones Used As Probes For The Characterization Of Gene Expression In cim 



Mutants In Northern Blot Analysis And DNA Microarrays. 



Probe 


Clone Name 


Obtained 
From 


Reference 
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3 


(Wanner a/., 1995) 
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3 


(Shirley a/., 1995) 
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(Buschees et al . 1 997) 


iSlNl 1 


I11IJ1 ^UU lliUUvlUlv utimiLUMijr ) 


g 


(Ryals ^ a/., 1997) 


JvUaxJ 


■t\r\r\ fn. cnpriflf nicpncp T*f*C1 CtflTlf^P 
IHiIl~l acc-apcdilw UloCaaC icoioLain-'C 


3 


(Tenturv ^/ a/ 1997^ 




***** QA17^T7 
(?A7 7OHO 1 / 






DAU 


ocicnsc ugaiiisi ucain \s\jv%f\ 


4 


^SufHmnto /2/ 1 995) 










AJHAA 
1Y2L/JV1 


fvi/n—Vii^ViTirl iti tyrant rvr r*f D>i D 


4 


J. Oangl, unpublished 


LoJL/1 


lesion simulating uiacdoc rcbiautiivc 


A 


^Dietrich a/ 1997"! 




cDNA 








1f*t1ia1 leaf* ^not homolot? est 


3 


(Gray era/., 1997) 
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CAT2 CAT3 


catalase 2 and 3 est 38C1T7. est 
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peroxidase C, estB25XP 


4 


(Jabs etaL, 1996) 


CPER 


cationic peroxidase, est 40H1T7 


4 


(Jabs al., 1996) 
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glutathione-S-transferase typeHl, 


2 


(Marrs, 1996) 




est 90B13T7 
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PCR fragment 689 to 1 136 bp 


8 


(Delaney etaL, 1994) 
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responsive to abscisic acid cDNA 


5 


(Gosti era/., 1995) 


PAT1 


phosphoribosyl-anthranilate 


6 


(Rosee/tf/., 1992) 




synthase gene 






GluPER 


glutathione peroxidase, est 


3 


(Eshdate/^/., 1997) 
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SPS 


sucrose-phosphate synthase cDNA 


8 


D. Guyer, unpublished 
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protoporphyrin oxidase, cDNA 


8 


(Lermontova et aL> 1997) 


EPSPS 
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8 
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phosphate synthase qDNA 
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Ascorbate peroxidase, cDNA 


7 


(Kubo ^a/., 1992) 


GS 


Glutamine synthetase, cDNA 


7 


(Peternmn e/ a/., 1991) 


ATVSP 


vascular storage protein, est 


3 


(Bergeref al, 1995) 
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pathogen-inducible est 31G1 1T7 


3 


(Aufsatz and Grimm, 1994) 


BJQ2 


biotin synthase, cDNA 


7 


(Pattone/ a/., 1996) 
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1 P. Epple and H. Bohlmann, ETH Zurich, Switzerland 

2 A. Molina, Escuela Tecnica Superior de Ingenieros Agronomos, Madrid, Spain 

3 ABRC stock center, Columbus, OH 

4 J. Dangl, University of North Carolina, Chapel Hill, NC 

5 J. Giraudat, Institut des Sciences Vegetales, CNRS, Gif-sur-Yvette, France 

6 A. Rose, University of California, Davis, CA 

7 D. Patton, Novartis Crop Protection, Inc., NC 

8 Genes cloned in J. Ryals* laboratory 

B. Southern blot analysis 

Approximately 0.5 to 5 \xg DNA is digested overnight in 200 ul reaction volume. DNA 
is precipitated by sodium acetate and ethanol and resuspended in 20 jil water. To separate the 
restriction fragments, the DNA is loaded in presence of 4 ul loading dye onto a 0.9% TBE 
agarose gel (run either overnight at 1 V/cm, or 3 to 4 hours at 4 V/cm). The gel is soaked in 
0.25 NHCl for 20 min, then for 30 min in denaturation solution (1.5 M NaCl, 0.5 M NaOH) 
and twice for 20 min each in neutralization solution (3M NaCl, 0,5 M Tris-HCl, pH 7.5), as 
described by Ausubel et al (1987). The DNA is transferred overnight onto a nitrocellulose 
GeneScreen Plus® membrane (Du Pont-New England Nuclear) in 10 x SSC, as described 
above for Northern blots (section 10A). Hybridization and washing are performed as for 
Northern blot analysis. 

C. Dot blot analysis 

Dot blots are performed using a Bio-Dot apparatus (Bio-Rad) as described by Gorlach et 
al (1995). Two ug of total RNA are denatured in 6 x SSPE (20 x SSPE: 20 mM EDTA, pH 
7.4, 3 M NaCl, 0.2 M sodium phosphate, pH 7.4) containing 20% deionized formaldehyde for 
15 min at 55°C and then chilled on ice. Two volumes of ice-cold 15 x SSPE are added and 
the samples are applied to a GeneScreen Plus® membrane which is pretreated with 12 x 
SSPE. After crosslinking, prehybridization and hybridization are performed as described for 
Northern blot analysis (section 10A). 

D. Hybridization of BAC filters and colony filters 

BAC filters are hybridized according to the TAMU BAC filter manual (Version 2, 
http;//tamu.edu:8000/-creel/bacman2.html). New filters are prehybridized at 65°C twice for 8 
hours in prehybridization buffer (0.5 M NaHP0 4 , pH 7.2, 7% SDS, 1% BSA, (fraction V), 1 
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mM EDTA, 10 mg/ml sheared salmon sperm DNA), used filters only once. After adding the 
probe, hybridization took place in the same buffer for 1 8 to 36 hours at 65°C. Filters are 
washed with 0.5 x SSC, 0.1% SDS 3 times for 20 tnin at 65°C and exposed on a 
Phosphorlmager screen (MolecularDynamics) as described above. 

E. Western blot analysis 

For Western (protein) blot analysis (Towbin et al, 1979), proteins from 0.25 g 
pulverized tissue are extracted in 500 pi extraction buffer (0.25 mM Tris-HCl, pH 6.8, 4.5 M 
Urea, 2% SDS, 5% p-mercaptoethanol). Equal amount of protein (determined by Bradford 
reagent, section 2.8.1) are loaded onto a 10% Tris-glycine gel (No vex, San Diego, CA) and 
run at 10 V/cm for 1 to 2 hours in 1 x running buffer (25 mM Tris-HCl, pH 8.3, 250 mM 
glycine, 0.1% SDS). The gel is soaked for one hour in equilibration buffer (20 mM Tris-HCl, 
pH 8.0, 150 mM glycine, 20% methanol) and proteins are transferred by electroblotting onto a 
nitrocellulose membrane (100 V, 1 hour constant current, Novex X-Cell II Blot module). The 
membrane is washed for 10 minutes in wash buffer (1 x PBS, 0. 1% Tween-20), then 
incubated for one hour in blocking buffer (1 x PBS, 0.1 % Tween-20, 5% milk powder). The 
PR-1 specific antibody is bound at 4°C overnight in incubation buffer (1 x PBS, 0.1% Tween- 
20, 1 % milk powder) and unspecifically bound antibodies are removed by washing the 
membrane four times in wash buffer. The second antibody (antilgG rabbit conjugated with 
alkaline phosphatase) is bound for 3 hours at room temperature in incubation buffer and the 
membrane is washed as before. The protein is detected using the NBT (p-nitroblue 
tetrazolium) method as described by Harlow and Lane (1988). 

1 1 . PCR Protocols And Genetic Markers 

A. SSLP PCR conditions 

Polymerase chain reactions (PCR) for microsatellite amplification (SSLP) are carried 
out following a protocol of Bell and Ecker (1994) in a Gene Amp PCR System 9600 or 
GeneAmp PCR system 9700 (Perkin Elmer-Roche Corp. Foster City, CA) using a final 
volume of 25 jil. Each reaction contained 5 jul of 10 fold diluted DNA from the DNA 
minipreparation (see above, section 2.9.3), 2.5 \x\ PCR 10 x reaction buffer (Perkin Elmer), 2 
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\il of a 10 mM dNTP stock solution, 1 pi of forward and reverse primers (20 ?M primer 
stocks), 03 pi AmpliTaq® gold DNA polymerase (5 U/ml, Perkin Elmer) and water qs. 25 pi. 
A typical reaction temperature cycle is: 10 min at 94°C, 40 cycles of 15 sec 94°C, 15 sec 55°C 
and 30 sec 72°C, then a last polymerization step at 72°C for 10 min. 3 pi loading dye are 
added and 15 pi of each reaction mix are loaded on a 4% metaphor intermediate melting 
agarose TBE gel (containing 2.5 x 10" 3 % v/v ethidium bromide). Gels are run for 2 h at 125 V 
in Ix TBE. Bands are observed under UV-illumination and documented with Polaroid 
3000/36° instant film. 

B. CAPS marker PCR conditions and other PCRs 

For CAPS markers, the PCR conditions are more variable depending on the length of 
the expected fragment size (2 min for 2 kb) and the melting temperature of the primers. A 
typical thermocycle program is: 10 min at 94°C 9 35 cycles of 30 sec 94°C, 30 sec 56°C and 2 
min 72°C, then a last polymerization step at 72°C for 10 min (Konieczny and Ausubel, 1993). 
If PCR beads are used (Pharmacia Biotech), only 2 min at 94°C are used prior to thermo 
cycling. Pooled PCR samples are used for restriction digest of fragments. For sequenced 
PCR fragments, restriction fragment polymorphisms are identified using the dCAPS software 
(Neff et al, 1 998) rather than by random trial. 

For sequencing, PCR fragments are pooled from at least two PCR reactions to minimize 
sequence differences generated during PCR, purified in solution (section 2.4.2) and diluted to 
a concentration of 100 (ig/mL Alternatively, PCR fragments are TA-cloned into pCR2.1- 
TOPO following the instructions given by the manufacturer (Invitrogen). 

C. Linkage analysis 

A set of 30 SSLP and CAPS primer pairs (see Appendix) is used on segregating F2 
populations to establish an initial map position. Genetic map distances are determined using 
MAPMAKER 3.0 b (Lander et ah, 1987; Lincoln et al 9 1992) run on a Sun SPARC 
workstation. Recombination frequencies are calculated using the MAPMAKER F2 algorithm 
and converted to map distances in centiMorgans (cM) using the Kosambi function (Kosambi, 
1944). 
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D. Long range PGR 

Long range PGR is performed on 0.5 ^g genomic DNA (Mundy et aL, 1 995). The 
lower phase (40 contained final concentrations of 1 x buffer (Perkin-Elmer), 2O0^M of 
each dNTP, 1.25 mM MgOAc, and 1.5 nM of each primer. It is covered with a wax bead and 
heated at 80°C for 5 min, then chilled to 20°C. The upper phase (60jil), containing 450 pM 
DNA, 4 Units rTth polymerase (Perkin Elmer) and 1 x buffer, is added on top of the lower 
phase, Thermocycling is as follows: 94°C, 1 min, 16 cycles (94°C 30 sec, 68°C 10 min), and 
14 cycles (94°C 30 sec, 68°C 10 min with 15 sec extension every cycle), 72°C, 10 min. 10 pi 
of the reactions are analyzed on 0.9% agarose gels. 

E. Other techniques for polymorphism detection 

To identify single nucleotide polymorphisms in short DNA fragments (up to 1 kb), two 
additional techniques are used, the heteroduplex analysis on high resolution gels (Hauser et 
aL 9 1998) and the RNase cleavage assay (Myers et al, 1985). 

For heteroduplex analysis, equal amounts (20 - 50 ng) of PCR fragments from two 
different A thaliana ecotypes are denatured at 95 °C for 5 min and cooled slowly to 42°C in 
the heating block. 5 pi of the samples are mixed with 1 pi triple dye (xylene cyanol, 
bromophenol blue, orange G; FMC Bioproducts), and applied onto a 0.7 mm thick lx MDE 
gel (FMC Bioproducts), containing 2.5 M urea. Electrophoresis is performed in 0.6 x TBE 
buffer using an adjustable slab gel unit (CBS Scientific, Del Mar, CA) with constant voltage 
(15 V/cm). After the run, the homo- and heteroduplexes are visualized by staining the gel 
with ethidium bromide. As a positive control, marker F21i5sp6, as described by Hauser et al 
(1998) is used. 

For RNase mismatch detection, genomic fragments (200 bp - 1 kb) are amplified from 
two A. thaliana ecotypes using PCR primers with incorporated T7 (5'taatacgactcactataggg - 
SEQ ID NO:l) and SP6 (5'atttaggtgacactatagga - SEQ ID NO:2) promoters. Using the Sp6 
and T7 RNA polymerase, both sense and antisense RNA probes are made, according to the 
manufacturers instructions of the MisMatch Detect™ II kit (Ambion, Inc., Austin, TX). Equal 
volumes of SP6 transcripts are mixed to T7 transcripts of the other ecotype, heated at 95°C for 
three min and cooled to room temperature. Different RNase digestions of the homo- and 
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hetero-RNA duplices are performed as recommended. Digestion products are analyzed on a 
2% agarose gel (5 V/cm, 1 x TBE, 25 min). 

F. Preparation of BAC ends by inverse PCR 

To generate probes from BAC ends, we modified a protocol described by Mozo et ah 
(1998). 1/10 of a pBelo BAC DNA minipreparation is digested with either EcoRV, MwcII, 
Rsal or Eagl (T7 end), or with Hhal, or HaeJR (Sp6 end). The digest is extracted with phenol 
: chloroform (1:1), and precipitated with sodium acetate and ethanol. Ligation is performed in 
a 100 |xl volume for at least three hours at room temperature. Ligase is heat inactivated at 
70°C (15 min), DNA is precipitated by ethanol, and circularized DNA is cleaved with Pvul 
(T7 end) or BsrBl (Sp6 end) in a 10 nl volume reaction. For amplification of the pBelo 
flanking DNA, standard PCR is performed with 56°C annealing temperature and 2 min 
extension time at 72°C, using for the T7 end the primers: 5'ttcccaacagttgcgcagc (SEQ ID 
NO:3) and 5'tcttcgctattacgccagct (SEQ ID NO:4), and for the Sp6 flanking DNA, the primers: 
5'tcacacaggaaacagctat (SEQ ID NO:5) and 5'acacaacatacgagccggaa (SEQ ID NO:6). PCR 
fragments are purified as described in section 2.4.2 and sequenced. 

G. Thermal asymmetric interlaced PCR of T-DNA insert junctions 

To clone the flanking genomic DNA of T-DNA inserts, thermal asymmetric interlaced 
PCR (TAIL PCR) is used as described by Liu et ai (1995). One out of six low stringency 
primers is used in successive PCR with three nested high stringency primers on either the 
right border, or the left border of the T-DNA. Reactions are performed on 5-fold diluted 
CTAB DNA minipreparations. The products of the second and third PCR are analyzed on 
agarose gels. If a small size difference between the second and the third PCR fragment is 
detected, the product of the third PCR is either purified for direct sequencing or cloned for 
sequencing by TA cloning into the vector pCR2.1 . 
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12. Genomics technologies 

A. Arraying libraries 

Est stock cultures are duplicated from liquid cultures by transferring cells into 96 well 
flat bottom culture plates (Falcon), containing LB freezing buffer (LB supplemented with 36 
mM K 2 HP0 4 , 13.2 mM KH 2 P0 4 , 1.7 mM sodium citrate, 0.4 mM MgS0 4 > 6.8 mM 
(NH 4 ) 2 SC>4, 4,4% (v/v) glycerol) with a disposable 96 needle inoculation tool. After growth 
overnight at 37°C, the cultures in microtiter plates are sealed using self-adhesive plastic seals 
(USA Scientific Plastics) and stored at -80°C For PCR, cultures are diluted 1:100 in 10 mM 
Tris-HCl, pH 8.5 and 10 jil of the dilution per 50 \A PCR reaction is used. To fill 96-well 
plates with liquid a Qfill2 machine (Genetix, Christchurch, GB) is used. To pick colonies 
grown on plates into a 96- or 384-well format, or in an array on a membrane, a Qpix robot 
(Genetix) is used. Membranes are put onto an agarose plate and colonies grew overnight. 
Cells are lysed on the membrane, and DNA is fixed on the support as described by Nizetic et 
al (1991). 

B. Two-dimensional protein gel electrophoresis 

For protein gels, 0.5 g frozen, pulverized tissue is suspended on ice in 200 \il SDS 
boiling buffer (5% SDS, 5% p-mercaptoethanol, 10% glycerol, 60 mM Tris-HCl, pH 6.8). 
The sample is spun down at 4°C at lOOOOg in a tabletop centrifuge and the supernatant is 
transferred into a new tube and kept on ice. Protein concentration is determined as described 
before (section 8A) using the Bradford reagent. A total amount of 100 \ig protein is loaded. 
Two-dimensional (2D) gel electrophoresis is performed according to the method of OTarrell 
(1975) by Kendrick Labs, Inc. (Madison, WI) as follows: Isoelectric focusing (IEF) is carried 
out in glass tubes of inner diameter 2.0 mm, using 2.0% pH 4 - 8 ampholines (BDH from 
Hoefer Scientific Instruments, San Francisco, CA) for 9600 volt-hrs. One ng of an BEF 
internal standard, tropomyosin protein, with lower spot of Mr 33,000 and pi 5.2 is added to 
the samples. An arrow on the stained 2D gels indicates this standard. After equilibration for 
10 min in Buffer 'O' (10% glycerol, 50 mM dithiothreitol, 2.3% SDS and 63 mM Tris-HCl, 
pH 6.8) the tube gels are sealed to the top of stacking gels which are on top of 10% 
acrylamide slab gels (0.75 mm thick) and SDS slab gel electrophoresis is carried out for about 
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4 hrs at 12.5 mA. For silver staining, the slab gel is fixed in a solution of 1 0% acetic acid/ 
50% methanol overnight. The following proteins (Sigma Chemical Corp.) are added as 
molecular weight standards to the agarose which sealed the tube gel to the slab gel: myosin 
(M r = 220.000), phosphorylase A (M r = 94,000), catalase (M r = 60,000), actin (M r = 43,000) 
carbonic anhydrase (M r = 29,000) and lysozyme (M r « 14,000), These standards appear as 
horizontal lines on the Coomassie Brilliant Blue R-250 or silver stained acrylamide slab gels. 
For Coomassie Brilliant Blue staining of the protein bands, the polyacrylamide gel is soaked 
in the staining solution (0.1% w/v Coomassie Blue, 16% v/v acetic acid, 42% v/v methanol) 
for several hours. The gel is destained in aqueous acetic acid (12% v/v isopropanol, 16% v/v 
acetic acid. 

For silver staining (Switzer et al, 1979), the gel is fixed at room temperature 
successively in 20% (w/v) trichloroacetic acid for one hour, twice for 30 min in 40% (v/v) 
ethanol, 10% (v/v) actic acid and twice in water for 20 min. The gel is soaked for 30 min in a 
10% (w/v) glutaraldehyde solution, followed by 3 washes in water (20 min each). The 
proteins are stained for 30 min in silver diamine solution (freshly made up 0.26% (w/v) 
NaOH, 1 .8% (w/v) ammonia, 3% (w/v) silver nitrate), washed three times in water and 
developed for 10 min in developing solution (0,05% (w/v) citric acid, 0.02% (v/v) 
formaldehyde) then transferred into stop solution (40% (v/v) ethanol, 10% (v/v) acetic acid). 

The stained gels are dried between sheets of cellophane. 

C. DNA microarray 

To amplify the Arabidopsis cDNA inserts cloned into Ml 3 derived plasmids, a PCR on 
colonies is performed. Bacteria are subcultured in 96 well format plates overnight in LB 
freezing media (supplemented with Amp). Aliquots of the cultures are diluted 1 : 100 in 10 
mM Tris-HCl, pH 8.5 and 10 ul of the dilutions are used per 50 \i\ PCR. The PCR mix 
contained per 50 ul reaction 5 \k\ AmpliTaq buffer (Perkin Elmer), 10 ul dNTP mix (20 mM 
each), 2 times 5 ul modified primers (M13 forward: 5*amino tgtaaaacgacggccagt - SEQ ID 
NO:7, M13 reverse: S'amino ggaaacagctatgaccat - SEQ ID NO:8, 10 uM each), 1 ul 
AmpliTaq Gold (Perkin Elmer). PCR thermocycling is performed in a Perkin Elmer 9700 
PCR machine as follows: 1 0 min at 95°C, 40 cycles of 30 sec at 95°C, 30 sec at 5 1 °C, 2 min 
extension at 72°C, followed by a 1 0 min extension at 72°C. 5 |il of all PCR reactions are run 
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on a L2% agarose gel (1 x TBE, 6 V/cm), After further purification through QIAquick-96 
microfiltration columns (Qiagen) and lyophilization, PCR products are resuspended in 10 
of 3 x SSC and spotted onto silane-coated glass slides (Synteny, Inc., Fremont, CA). The 
DNA is rendered single stranded by heat or alkali treatment. 

For probe preparation, single strand reverse transcription from an oligo-dT primer is 
performed in presence of Cy3-dCTP or Cy5-dCTP (Amersham, Arlington Heights, IL) using 
600 ng polyA RNA per sample. The reverse transcription reaction is performed in a 25 pi 
volume with 2 jig oligo(dT) 21-mer, 500 each of dATP, dGTP and dTTP, 280 yM dCTP, 
40 nM of Cy3 dCTP or Cy5 dCTP, 40 units RNAsin (Promega) and 200 units Superscript II 
reverse transcriptase (Life Technologies) in 1 x Superscript first strand buffer. Three specific 
control polyA RNAs (YCFR06, YCFR22 and YCFR23) from non-coding yeast genomic 
DNA are added into the reverse transcription reaction at 0.006 ng, 0.06 ng and 0.6 ng, 
respectively (ratios of the control RNA to polyA RNA are 1:100,000, 1:10,000, and 1:1000 
(w/w), respectively). After incubation at 37°C for 2 h, the reactions of two samples (Cy3 and 
Cy5) are combined and treated with 5 jjl of 0.5 M sodium hydroxide and 5 ^1 of 10 mM 
EDTA for 10 min at 65 °C to stop the reaction and degrade the RNA. Probes are purified 
using two successive Chroma Spin 30 gel filtration spin columns (Clontech) and lyophilized. 
Probes are resuspended in 20 jil hybridization buffer (5 x SSC, 0.2% SDS) and applied to the 
microarray (Schena et al, 1996). Hybridization is carried out at 60°C for at least 12 hours 
(Synteni). The slide is rinsed for 5 min each in 5 x SSC, 0.1% SDS and in 0.2 x SSC, 1% 
SDS at room temperature. Two-channel, confocal laser microscopes are used to scan the 
fluorescence emission after excitation at 532 nm and 633 nm (Shalon et ah, 1996). Local 
background in the surrounding area of each signal is automatically subtracted and for each 
experiment, a normalization over all signals is carried out. The expression quantification is 
scaled from -100 to +100 based on the three control RNAs and on consideration of various 
complex tissue and standard elements (further information can be found at 
http://www t synteni.com/client/controls/index.html). The indicated detection limit of the DNA 
microarray is 1 molecule per 100,000 RNA species (which equals 1 copy per cell) and the 
differential sensitivity can be as good as twofold. The raw data are fed into GemTools 
software for preliminary characterization. Extensive comparisons between gene groups are 
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done in Excel (Microsoft Corp., Redmond, WA), using the S Plus dist function for similarity 
calculations (Version 3.3, Mathsoft, Inc., Seattle, WA). 

D. Diversity of conditions used to describe the transcriptome of Arabidopsis thaliana 
during SAR. 

1 8 inducing conditions are as follows: Two-weeks old plants grown on herbicide (0.9 
\iM hydantocidin (1)), or adenylosuccinate synthetase-antisensed plants (2); leaf tissue from 
four-week old mutants cim6 (3), cim7 (4), cimll (5a); for comparison of reproducibility, 
sample cimll was also inversely labeled with the flourochroms (5b); NIM1 overexpressing 
line 8C (6); a NahG plant line (7) ; tissue from crosses of NahG to cim6 (8) and cimll (9); 
systemic, secondary tissue (10) and primary, infected tissue (11) of NahG plants, locally 
inoculated with incompatible pathogen Pseudomonas syringae pv. tomato (Pst) DC3000 
avrRpml at 106 cfu/ml harvested 44 hours after inoculation; tissue harvested after 4 hours 
(12) or 48 hours (13) after treatment with 0.3 mM BTH; tissue harvested 48 hours after 
inoculation with a suspension of 105 spores/ ml of the compatible isolate of Peronospora 
parasitica pv. Emwa (14) and of infected primary (15) and systemic secondary (16) tissue of 
wild-type plants harvested 44 hours after inoculation with Pst DC3000. avrRpml ; and 
systemic tissue oiniml-4 plants (Ryals et al, 1997) treated with Pst DC3000 avrRpml as 
above (17); tissue harvested 48 hours after inoculation with a suspension of 105 spores/ ml of 
the incompatible isolate of Peronospora parasitica pv, Noco2 (18). 

EXPERIMENTAL - RESULTS 

1 „ PR-l/Luciferase Line Construction And Characterization 

A. PR-1 phylogeny 

The expression of the PR-1 gene is the most reliable marker for the onset of SAR in 
Arabidopsis (Uknes et al, 1992). The PR-1 gene encodes a small (preprotein: 17677 Da, 161 
aa; cleaved: 14880 Da, 135 aa), acidic (pi of approximately 4.0) apoplastic protein. Although 
the function of the PR-1 protein remains unknown, several studies have shown that PR-1 
might play a direct role in conferring resistance to fungal pathogens. PR-1 has an 
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antimicrobial activity in vitro and confers resistance to oomycetes when overexpressed in 
planta (Alexander et al, 1993). Because of the central role of the PR-1 gene for this study, a 
similarity BLAST (Altschul et al 9 1990; Gish and States, 1993) search against the 
Arabidopsis PR-1 protein sequence was performed in Genbank (www.ncbi.nlm.nih.gov/htbin- 
post/Entrez, default parameters). PR-1 protein homologs were found not only in di- and 
monocotyledonous plants, but also in mammals and insects: PR-1 belongs to a family of 
cystein-rich secretory proteins (CRISPs) that groups mammalian SCP/TPX1 (sperm coating 
glycoprotein/testis specific protein) insect AG3/AG5 (venom allergen), fungal SC7/SC14 
(Schizophyllum commune), and plant PR-1 proteins together. The sequence identities of the 
homologs range between 30% and 80%. While the function of the human PR-1 homolog is 
less well understood, in insects, PR-1 -like proteins make up a major venom allergen. The 
close sequence homologies of the PR-1 protein to secreted cystein-rich proteins suggest a 
possible function of PR-1 in defense similar to other small antimicrobial peptides, such as 
plant defensins or thionins. This finding is consistent with the previously observed effects of 
PR-1 on fungal pathogens. 

B, Cloning and characterization of the transgenic PiW/luciferase line 
A 4258 bp upstream Xhol fragment of the Arabidopsis PR-1 promoter was cloned in 
front of the luciferase coding region in the binary vector pCIB200 {see, U.S. Patent 
Application No. 08/992,801 and U.S. Patent No. 6,03 1,153). Stable transformants in the 
ecotype Col-0 were selected on GM plates containing kanamycin. Segregation was analyzed 
in the T2 populations of selfed primary transformants (Tl). Several lines that segregated in a 
3:1 ratio were chosen for preliminary in vivo counting of background (non-induced) luciferase 
activity and chemically induced activity. 

One line, called 6E line, was chosen for further characterization. In a F2 population of 
an outcross to untransformed Col-0 plants, 147 out of 203 plants survived on selection for 
kanamycin resistance. The % 2 analysis for a 3:1 segregation ratio calculates to % 2 = 0.724, P < 
0.4. Southern analysis using either the luciferase gene or the right border of the T-DNA (RB) 
as a probe showed that only one insert was integrated into the genome. In particular, the 
timing of the expression of the transgene in the 6E line was expected to match the expression 
pattern of the endogenous PR-1 gene following both biological and chemical inductions. 
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I Chemical induction of the PR- 1 /luciferase gene 
For determination of chemical inducibility of luciferase activity, Fi^-7/luciferase plants 
were sprayed with three chemical activators of SAR: SA (5 mM), INA (375 pM) or BTH (375 
^iM or 5 mM) and luciferase activity was determined every 24 hours during a period of four 
days. For each measurement, six samples consisting of six leaves each were harvested. INA 
and BTH treatment at the standard concentration of 375 pM caused an induction of luciferase 
activity of more than 2000 fold within 48 hours and this level was maintained for at least two 
more days. In contrast, treatment with 5 mM SA lead to a 5000 fold induction of luciferase 
activity within 24 hours which was followed by a decline, possibly due to degradation or 
conjugation of SA. Treatment with 5 mM BTH caused an induction similar to SA but no 
pronounced decline of activity was observed within the first four days. RNA was extracted 
from the same tissue samples as used for determination of luciferase activity and PR- J mRNA 
levels were examined by Northern blot analysis. Luciferase mRNA is very unstable in plants 
(no plant polyadenylation signal was added, as the firefly luciferase gene possesses its own 
terminator sequence which is active in plants; Ow et a/., 1986) and can not readily be detected 
in Northern blot analysis. Because of the rapid turnover of the luciferase enzyme, its activity 
reflects roughly its transcription rate. Patterns of PR- 1 mRNA and luciferase activity were 
comparable, showing that luciferase activity reflects PR-1 transcription in these plants. To 
test whether the onset of SAR gene expression could be monitored in vivo, P7?-7/luciferase 
plants were treated with 375 yM BTH and sprayed with luciferin 2 days after treatment. Light 
emission of seedlings could be detected in vivo within 2 minutes using the photon-counting 
device. 

II. Biological induction of the PTJ-i/luciferase line 
To determine the systemic activation of PjR-7/luciferase subsequent to pathogen 
inoculation, /^-7/luciferase plants were sprayed at 24 and 12 hours before the pathogen 
treatment with 7.5 mM luciferin to inactivate luciferase (Millar et aL, 1992) and to reduce 
background induction. At 0 hours, approximately 50% of the area of fully developed leaves 
were infiltrated with either water or with a spore solution of P. parasitica Emwa (Table 5). 
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The incompatible interaction triggered a more than 150 fold systemic induction of the 
luciferase activity within three days. This induction could also be followed in vivo. 

Table 5: Induction of the P/J-7/luciferase transgene after treatment with the avirulent 
pathogen R parasitica Emwa (Emwa). Luciferase activity was inactivated 24 and 12 hours 
before the experiment by luciferin treatment. Values represent relative inductions of 
luciferase in vitro activity compared to water treated plants (set to 1). Each time point 



consists of 20 leaves infiltrated with either water or a spore solution of P. parasitica Emwa 
(10 5 spores per ml) derived from 10 different plants. 



Treatment 


0 hours 


12 hours 


24 hours 


72 hours 


H z O 


1+/-1 


1+/-0 


1+/-0 


1+/-0 


Emwa 


1+/-0 


3+M 


11+/-5 


1 62+7-68 



2. Mutagenesis And Mutant Screen 

In order to identify mutants that constitutively express the P#-Muciferase gene, the 6E 
line was submitted to EMS mutagenesis. Mutant screens in A. thaliana are usually performed 
in the M2 generation. In the Ml generation, the mutations are heterozygous and the plants are 
chimeric, since in A. thaliana seeds, at the time of the chemical mutagenesis, 12 cells 
represent the origin of the vegetative parts of the Ml plants. The size of the Ml population is 
dictated by the cost of the mutagenesis on the one hand and the size of the genome, and the 
mutation rate of the mutagen (mutations per genome and generation) on the other hand. In 
matured thaliana seeds, two diploid cells contribute to the formation of the inflorescence 
(Redei, 1975) and thus to the germ line (so called GECN = genetically effective cell number). 
We used 8400 Ml plants, corresponding to 4 x 8400 genome equivalents, pooled in 168 lots. 
This corresponds to roughly 33600 x 80 x 4 = 10,752,000 point mutations in the Ml 
population, based on the empirical assumption that about 80 homozygous point mutations per 
M2 plant are generated at our mutation rate (M-value = 0,147; Mednik ? 1988: Haughn and 
Somerville, 1987). This number has to be compared to 21,000 to 25,000 genes of the A. 
thaliana genome (Goodman et al, 1995). 

The required size of the M2 population can be derived from the size of the Ml to 
maximally exploit the genetic potential of the Ml . One has to consider that M2 seeds were 
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pooled from approximately 50 independent Ml lines, and that only half of theM2 siblings of 
a Ml plant can potentially carry one certain mutation (GECN = 2). For the detection of a 
recessive mutation (p = 0.25 in the M2), we expect therefore one plant out of 400 progenies 
(0.5 x 0.25 x 1/50) to show the mutational phenotype. To detect this plant with a high 
probability (P = 0,95), at least n plants per lot need to be screened, with 

n-ln(l-P)/tn(l-l/E) 

n: number of individual plants to be screened in each lot to 

detect any given mutation 
E: expectation value, here 400 

n-1200. 

We screened several M2 lots under different culture conditions since it was known that 
some SAR mutants are conditional (Dietrich et aL, 1994). Therefore, an average of 1500 
plants per lot or 250,000 plants in total, were screened for spontaneous expression of the PR- 
7/luciferase gene. This represents a coverage of the Ml gene pool in the M2 generation of 
P = 1 - (f) n (Redei and Koncz, 1 992) 

with 

P: probability to detect a recessive homozygous mutation in 
the M2 

n: number of M2 plants screened per Ml plant = 250000 / 
(168x50) = 29.7 

f: theoretical fraction of M2 plants that do not show a 
mutation present in one of the proposed effective two 
germ cells of a Ml plant =1-1/8 

P-l -0.875 29 - 7 = 98.1% 

Thus, the probability to identify a given Ml mutation in the M2 population approached 
saturation. 

The frequency of putative mutants did however not vary with the conditions, and was 
approximately 2.4 x 10' 3 . 1 60 M 1 pools contained at least one plant that constitutively 
expressed the Pi?-i/luciferase gene. 
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In total, 603 putative mutants were identified in this in vivo screen. Almost all of them 
were confirmed by the in vitro luciferase analysis in the M2 or their progeny. The phenotypes 
were therefore considered to be caused by genetic mutations. We expected a multigenic 
regulation of PR-1 gene expression and, thus, a high number of mutants. However, the 
number of mutants was iurther increased by the redundancy of mutant identifications in the 
M2, which provided several fold coverage of the Ml gene pool (M2 saturation of the Ml) and 
the fact that several independent mutations per gene (alleles) might have been identified (Ml 
saturation of the genome). To find out how many independent genes are actually involved in 
the SAR cascade, all 603 mutants would have to be mapped or crossed to each other 

Considering the frequency of PJ?-/-overexpressing plants in an M2 after EMS 
mutagenesis, T-DNA insertion mutagenesis might also yield mutants with constitutive PR* 
//luciferase expression, within a reasonable population size. Using T-DNA insertions to 
mutagenize the genome has the disadvantage of being labor-intensive and having a low 
mutation rate (1-2 inserts per genome, compared to over 80 point mutations per genome by 
EMS), but has the potential advantage of easy cloning of the mutated, "tagged' gene, by 
plasmid rescue or other techniques. Because of our finding from the EMS screen, that most of 
the cim mutants were dominant, or codominant (see below, section 3B), we screened the Tl 
population (primary, hemizygous transformants) for constitutive /^-//luciferase expression. 
10,000 Tl lines were screened for in vivo P/J-i/luciferase activity, and 80 of them were 
retested in the T2 generation. Strong luciferase activity was confirmed in 7 T2 populations 
(see section 4E, table 15). This low rate of confirmation in the T2 generation was caused by 
the low cut-off of luciferase activity that was used in the Tl as a criteria for selection in order 
to also find codominant mutants. Therefore, more false positive plants were retained than in 
the EMS screen. 

3 . Characterization Of cim Mutants 

A. Histological characterization of the cim mutants 

The goal of this study was to identify SAR activated mutants that do not show 
spontaneous cell death, according to the definition of cim mutants. In a secondary screen, the 
603 mutants were therefore subjected to a Trypan Blue lesion staining in the M2 and the M3 
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generation and examined both macroscopically and microscopically for cell death. More than 
90 mutants did not show macroscopic patches of cell death, but only 16 did not have any cell 
death under our growth conditions, as revealed by microscopy after staining. Most of the 
other mutants developed necrotic lesions at some stage of their life cycle, mostly in the leaf 
tips. 

One of the mutants with spontaneous cell death, designated mutant 779, was included in 
all the following experiments as a control. Mutant 779 displayed patches of autofluorescence 
and callose that normally accompany HR-like cell death. No callose was detected in the 16 
cim mutants. Although free of lesions, pleiotropic phenotypic alterations in the 16 cim 
mutants were not separated from the mutation that caused constitutive PR gene expression by 
three backcrosses. In general, cim mutants have a prolonged life cycle, a delayed flowering 
time (one to four weeks later than wild-type Col-0) and they set fewer seeds (approximately 
one third of Col-0). Some mutants also showed reduced germination. Leaf morphology varied 
from long, often curly leaves (cim2Q5, ciml\6\ to extremely small, round leaves (cimGll, 
cfmSlO). Mutant cirn611 showed a bright green leaf pigmentation, other cim mutants (cim7l3 f 
c/w810) had dark-green leaves. However, normal leaf morphology was also found, albeit 
mostly in the weaker mutants, cim^lS and c/w658 (weakness based on PR-1 gene expression 
and SA content, see below) as well as in the mutant cimlll that differed from wild-type only 
in size. 

B. Genetic characterization of the cim mutants 

All 16 mutants originated from different seed pools and were therefore considered 
independent mutations. All mutants were backcrossed at least three times to the PR- 
Muciferase parental line. 
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Table 6: Genetic characteristics of the 16 dm mutants identified in the P.R-7/luciferase 



EMS mutant screen. 





16 


Ml 


Luc in 


F2(+ 


:-)* 


x 2 (P) 5 


Map position 




cims 1 


pool 2 


Fl 3 










Mutants 


8 


36 


yes 


57 : 


25 


1.32 (P>0.2) 


chr. 2* 


described in 


205 


15 


yes 


40 : 


20 


2.22 (P>0.1) 


chr. 1 


more detail 


328 


2 


yes 


n.d 7 . 








658 


93 


yes 


58: 


16 


0.45 (P>0.5) 






677 


98 


no 


33 : 


99 


0 (P>0.95) 






695 


107 


yes 


31 : 


11 


0.03 (P>0.95) 


chr. 5 




713 


115 


yes 


43 : 


12 


0.30 (P>0.8) 


chr. 1 




716 


124 


yes 


34 : 


21 


5.10(P>0.01) 8 






[779] 


168 


yes 


47: 


60 


55.1 8 






810 


164 


yes 


33 : 


19 


3.69 (P>0.05) 






824 


155 


yes 


35 : 


11 


0.03 (P>0.95) 




Not further 


2 


17 


no 


n.d. 






characterized 


ll 6 


36 


yes 


87 : 


35 


0.89 (P>0.4) 




mutants 


81 


8 


yes 


n.d. 








367 


16 


yes 


n.d. 








671 


91 


yes 


28 : 


22 


9.63 (P>0.01)* 






714 


121 


yes 


15 


:5 


0 (P>0.95) 






741 


132 


yes 


22 : 


18 


8.53 (P>0.01) 8 





1 . identification number of cim mutant 

2. Ml lot of mutant origin 

3. luciferase activity in Fl of backcrosses to the 6E line at least five times above background; due to 
incomplete penetrance of the mutant phenotype, a varying percentage of plants in the Fl populations 
expressed the Pi?-7/luciferase gene 

4. segregation ratios of luciferase expressing to not-expressing plants in the F2 generation 

5. % 2 and probability of the observed difference to the expected 1 :3 segregation ratio 

6. ciml 1 originated from the same Ml seed lot as c//w8, they may be identical 

7. n.d.: not determined 

8. hypothesis of a 3:1 segregation rejected 

9. chr.: chromosome 



Selfed progeny of all mutants stably expressed PR-7/lucif erase. The presence of the 
mutations could normally be followed in all mutants by in vivo monitoring of luciferase 
activity in the Fl generation. 

To analyze the segregation ratios of the mutations, F2 populations of backcrosses, 
containing 20 to 100 plants were screened for constitutive luciferase activity and the resulting 
data were subjected to yl analysis (Table 6). The expression of the reporter gene in the Fl, 
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confirmed in random samples by Northern blot analysis for endogenous PR-1 expression, 
indicated that in all but two cases {ciml y cim677) the mutant phenotype was dominant. 
However, the analysis of the F2 segregation ratios suggested that many of these mutations 
were not fully penetrant. In addition, we can not exclude the possibility that in some cases 
(cimllS) two dominant genes are required to cause the observed phenotype (%2 for a 9:7 
segregation ratio = 0.69, P<0.4). In the case of cwz713, the morphological changes were 
inherited in a recessive manner, while the closely linked constitutive Pif-i/luciferase 
expression was codominant, with varying expression of the phenotype in the heterozygous 
plants. In cases where F2 segregation ratios were normal, we mapped the mutations to the 
responsible loci. Populations of usually 50 to 80 F2 plants (mutant crossed to ecotype Ler) 
with preselected phenotype were used to look for linkage between the cim phenotype and 
genetic markers. About 30 SSLP and CAPS markers that were evenly distributed throughout 
the genome, were chosen to find a primary linkage. If available, closer markers were chosen 
to further define this map position. ciml\3 was placed on the genetic map of Arabidopsis 
thaliana on chromosome 1 between markers mi291a and markers nga280 (see below, section 
4B). cimlQS is also located on chromosome 1, between markers nga280 (20 recombinants in 
116 analyzes meiosis) and ml85 (19 recombinants in 148 meiosis). cimS is located on 
chromosome 2, between markers veOl 7(16 recombinants in 148 meiosis) and ngal 68 (9 
recombinant in 122 meiosis). cim695 lies on chromosome 5 between markers DFR (22 
recombinants in 106 meiosis and LFY (17 recombinant in 1 10 meiosis). The map positions of 
the mutations on chromosome 1 and 2 do not match the map position of known mutations in 
genes encoding functions in disease resistance and/or SAR. Mutant cim695 is in a region of 
chromosome 5 termed MRC-J, which contains a number of R gene homologs (Botella et al, 
1997; Holub and Beynon, 1997). cim205 and ciml\7> map close to but distinct from cpr6 
(Clarke et aL, 1998). 

C. Biochemical characterization of the cim mutants 
I. cim mutants accumulate salicylic acid 

One characteristic of SAR is the accumulation of SA, which is not only sufficient to 
induce PR gene expression and resistance, but also required, as shown with the SA-degrading 
NahG lines. Free and total (= free and conjugated) SA content was measured in 90 mutants. 
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Although we expected to find some mutants downstream of SA, for example possible gain-of- 
function nimllnprl alleles, all mutants (with the exception of cim65$) accumulated 3- to 15- 
fold more SA than untreated wild-type plants (Table 7). A control treatment with a virulent 
Erysiphe cichoracearum pathogen caused a 7-fold increase in total SA content after three days 
of infection. 



Table 7: Salicylic acid content is increased in most cim mutants. Total salicylic acid 
content was determined by HPLC separation of an organic tissue extraction. Results (in ng 
S A per mg tissue fresh weight) are mean values and standard deviations of three independent 
measurements. As a comparison, SA content was measured in wild-type tissue infected with 



Erysiphe cichoracearum, harvested 3 days after inoculation. 



mutant 


total SA (ns/mg fresh weight) 


wild-type 


296+/- 25 


wild-type + Erysiphe 


2030 +/- 890 


cim% 


1958 +/- 835 


cim205 


1657+/- 436 


dw328 


899+/- 16 


cim65$ 


294+/- 21 


cim617 


4154+/- 211 


cim695 


2256 +/- 223 


ciml\3 


1500+/- 78 


ciml\6 


1350+/- 267 


cimBlO 


3415 +/- 331 


cim&24 


2190+/- 491 



On average, among the 90 mutants for which S A content was measured, lesion mimics 
have higher levels of SA than cim mutants, but the two distributions are overlapping: cim 
mutants can have more SA than lesion mimic mutants, cim mutants are hence not simply a 
weak subclass of lesion mimic mutants, but ought to be considered as a distinct class of 
mutants. Free S A content was about tenfold less than total SA and was always correlated to 
the total SA content thus excluding from our collection mutations in the regulation of this 

expression, see below), mutants can be classified into strong cim mutants (e.g. cim$ 9 cim205, 
cim677 9 c/m695, c/m810, cimS24) and weak cim mutants (e.g. cz/w328, Ci'w658). 
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II. Other phenylpropanoids in cim mutants 

To determine if the elevated SA levels in the cim mutants were the result of altered 
regulation at a specific step in the S A biosynthesis, attempts were made to measure levels of 
SA precursor molecules. We were however not able to detect *-CA or benzoic acid in plant 
extracts, because of difficulties in detection. Unlike SA, these compounds emit no detectable 
fluorescence (kmux at 230 nm). 

In a cooperation with Pro£ Dr. Strack (Halle, Germany), the phenylpropanoid content in 
the ten cim mutants was analyzed in more detail, using two different organic extractions (80% 
methanol, or dichloromethane) and a basic cell wall extraction procedure, followed by a 
separation on a C-l 8 reverse phase HPLC. With minor, unidentified exceptions, no variations 
were detected in the UV spectra of the mutants compared to wild-type. 

IEL cim mutants can accumulate low levels of camalexin 
Plants under pathogen attack accumulate antimicrobial molecules, called phytoalexins. 
The major phytoalexin in Arabidopsis is camalexin, derived from a tryptophan precursor. 
Camalexin can be visualized under UV light after TLC separation of methanol plant extracts. 
Both a synthetic standard and tissue harvested after Pseudomonas infection were used as 
controls. The Rf for synthetic camalexin was a bit lower (0.77) than the R f for camalexin in 
complex mixtures (0.78). Mutants c««328, cim617, ciml\6, 779 and cimMO accumulated 
between 1 \xg camalexin/cm 2 (based on the synthetic camalexin standard) and the amount that 
accumulated in Pseudomonas-mfecled tissue after 3 days (literature value: 1 .2 fig 
camalexin/cm 2 ; Glazebrook and Ausubel, 1994). Mutant cimS24 accumulated less than 1 
jxg/cm 2 , and cimS, cwm658, cim695 and cimlYS did not show substantial amounts of 
camalexin. A quantification of the camalexin by HPLC of samples isolated from TLC plates 
was not successful, possibly because the amount of camalexin was too small for the available 
detection system. 
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D. cim mutants exhibit resistance to pathogenic microorganisms 
In order to show that S AR (or LAR) is constitutively activated in cim mutants, 
resistance to SAR pathogens must be shown. 

I Most cim mutants are resistant to Peronospora parasitica 
The response of the mutants to two isolates of the oomycete parasite Peronospora 
parasitica which are virulent on wild-type A. thaliana Col-0 (Holub et al., 1994) was tested. 
Infection was scored in adult leaves eight days after inoculation with P. parasitica Nocol. 
While Col-0 wild-type plants were covered with hyphae and displayed cell death along the 
fungal structures, no fungal structures, chlorosis or spontaneous macroscopic necrosis was 
observed in the "strong" cim mutants with high levels of PR-1 gene expression and SA eight 
days after infection. Only the two "weak" mutants d/«328 and cim65S allowed fungal hyphal 
growth and slight sporulation. Trypan Blue staining for hyphal growth and cell death 
revealed, however, that in some mutants trailing necrosis or HR like pits (Holub et al. y 1994) 
occurred around the hyphal penetration sites (e.g. cim695, ciml\6). This phenomenon was 
not correlated to SA content or PR gene expression. A second compatible P. parasitica 
Emco5 (Holub and Beynon, 1997; McDowell et ah, 1998) was applied to younger plants in a 
cotyledon assay, because infection of wild-type plants is stronger at this earlier stage. 
Resistance to P. parasitica Emco5 was similar to that observed with the isolate Noco2. The 
results suggest that the observed resistance is not age-dependent or an isolate-specific 
reaction. 

IL A subset of cim mutants is resistant to Erysiphe cichoracearum 
To determine if the disease resistance seen in the cim mutants was specific to 
Peronospora or more broad-spectrum, the mutants where challenged with a second fungal 
pathogen. Erysiphe cichoracearum is virulent on most A thaliana ecotypes, including Col-0 
(Adam and Somerville, 1996). To have a negative control for pathogen infection, it was first 
shown, that Col-0 can be completely protected from Erysiphe infection by BTH treatment (0.3 
rnM). A disease rating system between 1 (resistant) and 3 (susceptible) was used to quantify 
macroscopic symptoms. Interestingly, this assay revealed a differential response among the 
cim mutants. Some cim mutants (e.g. c/m328, c/m810) are completely resistant to E. 
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cichoracearunty and others completely susceptible. The resistance did not correlate with the 
strength of PR~1 gene expression or SA content. The two strongest mutants cim&ll and 
cim&lO were resistant, but c/w328, with low PR-1 gene expression and SA accumulation, also 
displayed an almost complete resistance (disease rating 1.01). 

ILL Some cim mutants exhibit a good resistance to virulent Pseudomonas syringae 
pathovars 

To check the cim mutants for resistance to bacterial pathogens, they were inoculated 
with several different virulent Pseudomonas syringae strains. Interpretation of the results of 
these experiments was often hampered by the non-wild-type morphology and developmental 
stage of the cim mutants. It became clear, however, that resistance to Pseudomonas spp. was 
in many mutants not as clear-cut as resistance to Peronospora and Erysiphe. Differences in 
resistance to the aggressive pathogen Pseudomonas syringae pv, syringae DC3000 were small 
among the mutants. Therefore the less virulent strain P. syringae pv. maculicola ES4326 was 
chosen to better illustrate the spectrum of resistance to P. syringae among these mutants. 
Mutant cim611 9 cim695 9 cim7l3, and czm810 exhibited a bacterial proliferation reduced more 
than 10-fold compared to wild-type (Table 8). For mutants cim205 and ciml\6, the bacterial 
titer five days after inoculation was 2-fold lower than in wild-type. While mutants cimS and 
c/m824 are both in the class of "strong" mutants, they were at least as susceptible to this P. 
syringae isolate as wild-type (Table 8). 

The response to avirulent Pseudomonas strains also differed among the cim mutants. 
Several cim mutants (cim20S 9 cwi328, ciml\6, cim824) displayed an HR identical in timing 
and appearance to wild-type. Other cim mutants (cim7l3, cimS\0) did not show the typical 
HR symptoms or HR was delayed. In cim&ll \ the HR occurred 2 to 3 hours earlier than in 
wild-type, in which the HR was visible 8 to 9 hours after inoculation. This mutant showed 
increased sensitivty to virulent Pseudomonas strains, although the injection of MgCb did not 
have any visible effect on the plant. 
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Table 8: Resistance of cim mutants to Pseudomonas syringae pv. ES4326. Bacterial 
colony forming units (cfu) were counted at three and five days after infiltration from four 
independent experiments, each containing four leaf punches per mutant per time point. While 
in most cims bacterial growth is significantly limited, in some mutants (c/rnS, cim824) 
proliferation is not reduced. 6E: wild-type, B: wild-type pretreated with 0*3 M BTH 2 days 



prior to Pseudomonas infection. 



mutant 


mean cfu, 3dpi 2 


i value (P), 3 dpi 3 


mean cfu, 5 dpi 2 


t value (T), 5 dpi 3 


6E 


965000 




2140000 




B 


36500 


5.82 (P>0.995) 


38000 


1.74 (PXX9) 


8 


1752500 


0.53 (P>0.65) 


12550000 


3.37 (P>0.99) 


205 


333500 


2.53 (P>0.975) 


995000 


0.60 (P>0.7) 


328 1 


151500 


0.94 (P>0.8) 


68000 


3.23 (P>0.99) 


677 


105000 


5.40 (P>0.995) 


35500 


1.74 (P>0.9) 


695 


467500 


1.33 (P>0.85) 


69000 


1.71 (P>0.9) 


713 1 


25650 


1.74(P>0.9) 


14200 


4.66 (P>0.995) 


716 


82500 


5.47 (P>0.995) 


840000 


1.04 (P>0.8) 


810 


618666 


L13(P>0.85) 


273333 


1.50 (P>0.9) 


824 


730000 


0.56 (P>0.7) 


3350000 


0.58 (P>0.7) 



1. Mutant cim328 and cim713 were tested in a separate experiment, the cfu values for 6E wild-type 
were at 3 dpi: 327000 and at 5 dpi: 362500 

2. Bacterial viable count, expressed as cfu (colony forming units per 4 leaf discs), calculated from 
four independent repetitions, dpi: days after inoculation 

3. Differences of mean values of 6E wild-type and mutants, expressed as / values 



E. Molecular characterization of the cim mutants 
I. Northern blot analysis 

To further characterize the mutants at molecular level, gene expression for known 
marker genes of various pathways was analyzed. The results may provide insight into the 
signaling pathways that are turned on in cim mutants and allow a further understanding of the 
maintenance phase of SAR. 

Genes implicated in oxidative burst and cell death 

Expression of genes implicated in regulating the cellular redox state or the oxidative 
burst was monitored in the cim mutants. Superoxides and/or hydrogen peroxides are thought 
to determine the outcome of plant-pathogen interactions by several means: z\) they can 
reinforce the cell wall, ii) they can act as messengers, and iff.) they could be involved in direct 
killing of the pathogen or the host cell during the HR (Low and Merida, 1996). Sources of the 
reactive oxygen species (ROS) can be cell- wall bound NADPH oxidases, peroxidases and 
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lipoxygenases. ROS are degraded in the plant cell by superoxide dismutases (SOD), and 
successively in the Halliwell-Asada (Ascorbate-Glutathione) cycle, reducing H2O2 to H 2 0 
under use of NADPH (Asada, 1994; Halliwell and Gutteridge, 1989). Three families of SOD 
are known, classified by their cofactor (Bowler et al , 1 992). It has been previously shown 
that Cu/ZnSOD gene expression is suppressed during pathogen infection and oxidative stress 
(Fodor et al, 1997). In several cim mutants (c/m716, cimSlO, cim658), Cu/ZnSOD gene 
expression is suppressed. As an alternative to reduction of H2O2, catalases may dismutate 
cellular H 2 0 2 . In Arabidopsis, three genes encode for catalases, that are 70 to 72% identical at 
the nucleotide level (Frugoli et al, 1996). The specific functions of catalase isozymes are still 
not well understood. The expression of the catalase 2 and 3 genes is not significantly altered 
in the cim mutants, or by pathogen treatment. Peroxidases also degrade H2O2 by oxidizing 
specific substrates. They are responsible for lipid peroxidation, and cell wall cross-linking. In 
Arabidopsis, little is known about the cellular function and regulation of the two peroxidases 
used as probes in Northern blot analysis, cationic peroxidase and peroxidase C (Jabs et al, 
1996). The genes of both peroxidases are induced in the lesion mimic mutant 779, and to 
some extent by BTH and Peronospora, as well as in cim%24 and cim8 10. Peroxidase C is 
also weakly induced in the other mutants. 

Although one of the selection criteria for the cim mutants was absence of cell death, 
expression of several genes known to cause lesion mimic phenotypes when mutated were also 
checked for changes in expression. The barley MLO gene, whose function is unknown, has 
several homologs in Arabidopsis (Buschges et al, 1997). One of the ests (205N12T7) was 
used to quantify the expression of the MLO genes in the cim mutants. The degree of 
homology between the different Arabidospsis MLO homologs, and their total number in the 
genome is not known, it is therefore likely that the signal on the Northern (RNA) blot reflects 
the expression of several genes and hides potential differential gene expression. No 
significant changes in gene expression were detected. The same is true for the Arabidopsis 
lesion mimic gene, hdl (Dietrich et al, 1997) and for the Arabidopsis homolog of the maize 
lethal leaf spot gene (llsl, est 84E8T7) (Gray et al, 1997). A gene putatively implicated in 
the regulation of cell death, DAD (defense against death; Sugimoto et al, 1995) was slightly 
induced in all mutants compared to wild-type. Its two-hybrid-interactor, MOM did not show 
major differential gene expression among the cim mutants. 
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Genes implicated in pathogen defense responses 

While the expression of genes implicated in the regulation of cell death for the most part 
did not change significantly in the aim mutants, genes whose expression correlates with 
pathogen infection and disease resistance, were induced in cim mutants* The intensity of 
expression of PR- J, whose promoter was used to identify the mutants, correlated with the 
level of luciferase activity in all mutants. In mutants with weaker luciferase activity (c/w328 
and c/m658), PR-1 gene induction was weaker, but still stronger than in wild-type. The 
strongest mutants in terms of both luciferase activity, and disease resistance accumulated the 
highest amounts of PR-1 messengerRNA. cim611, cim$24 and the lesion mimic 779 all 
showed over 40-fold induction of PR-1 gene expression, compared to wild-type. PR- J gene 
expression is stronger in these mutants than that obtained by BTH or pathogen induction of 
SAR in wild-type plants. This finding eliminates the possibility that the luciferase activity 
used to select the mutants was the result of a mutation in the promoter of the transgene, since 
all mutants also expressed the endogenous PR-1 gene to high levels. Expression of the two 
other Arabidopsis SAR genes (Uknes et aL, 1992), PR-2 (data not shown) and PR~5 was also 
elevated in most of the cim mutants, though not to the same degree as PR~L It has been 
previously shown that the regulation of these genes does not always correlate with PR-1 gene 
expression (Reuber et aL, 1998), The PR~5 gene expression was, however, induced in all 
mutants, with the exception of the weakest mutant, cim65S. 

Thionins and defensins are genes that are induced by pathogen attack. They are not 
induced during SAR but are regulated by an S A-independent, JA-dependent signaling 
pathway (Epple et aL, 1998; Penninckx et aL, 1996). Both an antagonistic interaction and a 
concomitant induction of PDFL2 and PR-1 expression have been described in the literature 
(see Maleck and Dietrich, 1999 for review). Interestingly, thionin2J was induced in some 
cim mutants (c/w658, cim671 y ciml\6 and mutant 779). PDFL2 is known to be induced in 
several lesion mimics, and was also induced in mutant 799. It was induced in cim695> cim611 
and cim824, but is suppressed in other cim mutants (cim%10, cim205, and by BTH). Notably, 
in some cim mutants, the expression level of PDFL2 did not correlate with the resistance to 
Erysiphe dehor acearum, as reported previously for some cpr mutants (Bowling et aL , 1997): 
For example, the lesion mimic mutant 779 displayed strong PDF 1.2 gene expression but was 
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susceptible to the fungal pathogen, while cwi328 showed PDFL2 gene expression at 
uninduced wild-type levels, but almost complete resistance. 

Some members of the lipid transfer proteins {LTP) gene family have similar induction 
kinetics as defensins and thionins, and are thought to play a similar role in defense (Molina 
and Garcia-Olmedo, 1997). In a Northern blot, LTP appears to be constitutively expressed, 
probably due to the cross hybridization of the probe with multiple LTP transcripts of the gene 
family. 

Two genes that are involved in the signaling leading to systemic disease resistance have 
been identified by mutant screens and positional cloning. The NIM1 gene is modestly SA- 
inducible (Cao et al, 1997; Ryals et al, 1997) and is induced in the cim mutants with the 
highest SA accumulation {cimlQS, cim611, cwi695, cim&lO, cimS24). The NDR1 gene that is 
induced in incompatible plant-pathogen interactions reflects these weak changes. 
Interestingly, the previously described induction of the NDR1 gene by pathogens (Century et 
ah, 1997) was not observed in the pathogen-infected control (tissue harvested eight days after 
inoculation with P. parasitica Noco2). This finding is consistent with the observation that 
NDR1 function is not required for resistance to Noco2 (Century et al, 1997). 
Genes implicated in secondary metabolism 

The expression of the gene of a key enzyme in the shikimate pathway, EPSP synthase 
(Gorlach et al, 1995) was not induced in any of the mutants (data not shown). However, PAL 
genes, which encode the putative rate-limiting enzyme in the general phenylpropanoid 
pathway leading to SA biosynthesis (Bate et al, 1994) and which are inducible by many biotic 
and abiotic factors (Wanner et al., 1995) were induced in some mutants (e.g. in cimlYl, 
ciml\S) and by BTH treatment, but not in others (cim695, mutant 779). We used the PALI 
gene clone as a probe, which is very similar to the PAL2 gene in Arabidopsis (the amino acid 
sequences are 90% identical), but less similar to the PAL3 gene, which is usually expressed at 
very low levels (Wanner et al, 1995). The induction of PAL gene transcription did not 
correlate with the S A content or disease resistance in these mutants. Considering the 
accumulation of SA in the mutants, and the primary regulation of the phenylpropanoid 
pathway on the transcriptional level, it is remarkable that not all mutants showed a similar 
pattern of induced gene expression for key enzymes in this pathway. Preliminary results 
showed that PAL activity in all cim mutants (and a pathogen-infected control) were 1 .2 to 1 .4 
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fold higher than in wild-type (except for cim65S) 7 suggesting a post-transcriptional regulation 
of PAL activity. 

The expression of the chalcone synthase gene (CHS; Shirley et ah, 1995) paralleled the 
induction of PAL gene expression in all mutants. To our knowledge, in Arabidopsis, the 
induction of chalcone synthase gene expression during SAR has never been studied. 

Concomitant with the finding that none of the mutants accumulated high levels of 
camalexin, phosphoribosyl-anthranilate synthase (PAT1) gene expression was only weakly 
induced in the cim mutants, PAT1 gene expression is tightly correlated to gene expression of 
the other enzymes in the indole biosynthesis pathway and induced during bacterial pathogen 
infection (Zhao and Last, 1996). We did not find a strict correlation between accumulation of 
PAT1 transcript and camalexin accumulation, although cimffll, cz/«810, and cim&24 have all 
higher levels of camalexin and PAT1 gene expression. 
Hormone-inducible marker genes 

The expression of some plant hormone-inducible genes was monitored in cim mutants. 
The Arabidopsis vegetative storage protein acid phosphatase gene (AtVSP), a marker for 
jasmonic acid induced gene expression (Berger et al, 1995) was either weakly or not induced. 

PR-4 expression is ethylene-inducible. Its expression in the cim mutants was about 20- 
fold weaker than that observed in an ethylene treated control plant and not induced above 
wild-type level. 

A possible exception may be the Rabl8 gene, an example of an ABA-inducible gene 
(Merlot and Giraudat, 1997). Rabl8 gene expression was induced in c/m328, cim695 and 
mutant 779. 

In conclusion, the induction of SAR in cim mutants most likely does not induce or 
depend on other hormonally regulated pathways as monitored by marker gene expression, 

H. Gene expression profiling using a DNA microarray 
The gene expression patterns obtained by Northern blot analysis in the different cim 
mutants seemed to be similar to each other, and the observed qualitative differences in 
pathogen resistance are not easily explainable using these data. To get a more complete 
description of the transcriptional changes during SAR and in cim mutants, a new technology, 
the DNA microarray, was tested. Because SAR and the cim mutants have been quite well 
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studied in terms of gene expression, they provided a good system to establish this technology 
(technology verification). At the same time, new marker genes, or gene profiles, might help to 
unravel the different regulatory mechanisms involved in plant disease resistance (knowledge 
discovery). 

Contrasting other genome-wide gene expression monitoring techniques, such as 
differential display or cDNA AFLP (Bachem et al, 1996; Diatchenko et aU 1996), the DNA 
microarray technique can show expression patterns only for cataloged genes (although not 
necessarily with known function). For Arabidopsis, an extensive expressed sequence tag (est) 
collection is available. We used part of the Michigan State University (MSU) non-redundant 
est collection, containing 13,000 clones in 135 96-well plates, as "elements" on the DNA 
microarray 

On each microarray, a competitive hybridization with two differentially labeled probes 
was carried out and a signal (per fluorochrome and per element) was integrated and the local 
background subtracted. The samples prepared for hybridization on the DNA microarray are 
listed in Table 9. 



Table 9: Sample pairs used in competitive hybridization experiments on the DNA microarray 



Experiment 


cy3 probe 


cy5 probe 


1 


cim205 


wild-type 


2 


cim32S 


wild-type 


3 


ciml 1 3 


wild-type 


4 


wild-type 


ciml 13 


5 


wild-type + BTH 1 


wild-type 


6 


NahG 


wild-type 


7 


cimlQS x NahG 


wild-type 


8 


c//n713xNahG 


wild-type 



L 0.3 mM BTH were sprayed 2 days prior to harvest 



Technology evaluation 
We used several controls to evaluate the new DNA microarray technology for its use in 
plant biology. 

(i) element-to-element reproducibility 
In order to spot the ests on a glass slide, the plasmid inserts were amplified from liquid 
cultures using a modified PCR primer (5' amino linker). About 90% of the inserts gave 
sufficient PCR products, but the yield could not be improved by isolating the plasmids prior to 
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PCR, In addition to the est collection, one plate with known and well characterized genes was 
prepared for spotting. The clones comprised both constitutive genes such as the biotin 
synthase and protoporphyrin oxidase genes, and pathogen- or stress-induced genes (PR genes, 
oxidative stress genes). To evaluate the element-to-element reproducibility in the same 
hybridization, all cDNAs of the control plate were spotted three times on the DNA 
microarray. Table 10 shows the values for 11 genes whose expression was upregulated in the 
three dm mutants compared to wild-type. The standard deviations did not exceed 30% of the 
mean values and were in many cases much better. Inconsistency in spotting DNAs onto the 
slide is a more common source of variability than differences in the actual hybridization 
signal For example in the hybridization experiment 2 (cim32S versus wild-type), the spotting 
failed twice for the CHS gene. No signal was read from two of the three spots. Spotting is 
still the most delicate process in the microarray fabrication and may fail in a certain 
percentage of arrays. The precise fail rate of spotting is difficult to estimate because PCR 
errors also add to the rate of signal failure (seen in Table 10, for the PAL and the LOX gene), 
but both together do not exceed 15%. In conclusion, element-to-element variation is at an 
acceptable leveL 

Table 10: Element-to-element variability on the DNA microarray. cDNA clones were 
spotted three times on the array and expression values were compared in three independent 
hybridization experiments, using as probes UNA from cimlW, cimlQS and c/w328, and wild- 
type as comparison. Raw data (expression) and mean and standard deviation (mean+/-std) for 



1 1 genes with significant inductions are shown. 



Gene 


cim20S 




c/*w713 




expression 


mean+/-std 


expression 


mean+/-std 


expression 


mean+Astd 


PR-1 


10.8/9.1/6.2 


8.7+/-1.9 


10.0/7.5/7.4 


8.3+A1.2 


9.0/6.6/2.8 


6.1+/-2.6 


PR-5 


3.6/3.3/3.1 


3.3+/-0.2 


4.1/3.8/3.8 


3.9+A0.1 


6.2/5.9/6.2 


6.1+A0.1 


PR-2 


4.7/4.7/2.1 


3.8+/-1.2 


3.4/2.0/3.0 


2.84-/-0.6 


6.2/6.0/3.3 


5.2+A1.3 


PAL 1 


3.5/-/- 


3.5 


3.4/-/- 


3.4 


5.2/-/- 


5.2 


CuZnSOD 


1.6/1.4/1.2 


1.4+/-0.2 


2.2/2.1/2.1 


2.1+AO.O 


4.3/2.4/3.4 


3.4+A0.8 


LOX 2 


1.6/2,1/- 


1.9+/-0.3 


1.9/1.7/- 


L8+A0.1 


4.2/3. 6A 


3.9+A0.3 


CHS 


1.5/1.2/3.7 


2.1+/-U 


3.6AA 5 


3.6 


3.6/1.2/1.0 


1.9+A1.2 


PRXC 


2.8/2.6/2.1 


2.5+A0.3 


2.1/1.6/1.7 


1.8+A0.2 


3.4/2.6/2.3 


2.8-f-/-0.5 


NIM1 


3.5/2.7/2.2 


2.8+A0.5 


2.7/2.4/1.5 


2.2+A0.5 


3.2/2.6/2.2 


2.7+A0.4 


GLUPER 4 


2.5/2.5/2.5/ 


2.4+A0.1 


2.4/2.0/2,0/ 


2.0+A0.2 


2.2/1.9/2.0/ 


2.0+A0.1 




2.4/2.4/2.2 




1.9/1.7/1.7 




2.1/2.1/1.8 




APX 


1.7/1.7/1.6 


1.7+/-0.0 


1.4/1.3/1.3 


1.3+AO.O 


1.8/1.7/1.7 


L7+/-0.O 
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1 Only one element gave a signal in all three hybridizations, possibly due to a failed PCR reaction 

2 One element did not give a signal in any hybridization 

3 Two elements did not give a signal in this particular hybridization, thus revealing a spotting error 

4 GLUPER was spotted 6 times on the DNA microarray 

(ii) experiment-to-experiment variability 
For each experiment, a pair of probes was prepared. Single strand cDNA was labeled 
during synthesis with the fluorochromes cyanine cy3 (red) or cy5 (green). These dyes differ 
only in one double bond and it is likely that they are incorporated with identical efficiency in 
the first strand cDNA synthesis. The cyanines have very high, but similar molar absorptivity 
(s > 50000 cm^M" 1 ) and large fluorescence enhancements upon binding to nucleic acids, Cy3 
and cy5 have distinct and narrow emission peaks at 532 run and 633 nm, respectively. To 
estimate the variation between two experiments, and to estimate the influence of the dyes, we 
labeled the same pair of RNA twice but reciprocally, i.e. in the first hybridization, we 
compared the RNA of ciml\3 (cy3) to the RNA of wild-type (cy5), in the second 
hybridization, we compared wild-type (cy3) to ciml 13 (cy5; experiment 3 and 4 in Table 9). 
The difference between the value pairs ranges in general between 10% and 30%. These 
changes do not exceed the element-to-element variability. The linear regression coefficient 
for the 10 data pairs calculates to r = 0.83. The data points from the two experiments can thus 
be correlated linearly and different hybridization experiments may be compared to each other. 

(Hi) Comparison of DNA microarray and Northern blot analysis of gene 
expression 

A third validation is a direct comparison between differential gene expression quantified 
by the DNA microarray technology and by Northern blot analysis (Table 1 1). With few 
exceptions, such as the PR-1 gene expression in cim205 and ci>w713, the DNA microarray 
gives higher absolute values of gene induction than obtained in Northern blot analysis. This is 
not surprising because the values are normalized on a scale from +100 to -100 which appears 
to extend the scale (compare section 5B, where the same phenomenon was observed). 
Apparently, the yeast RNA that was spiked for normalization purposes in the plant samples 
did not match exactly the abundance of plant mRNA in the cell. Induced expression levels 
between 3-fold and 6-fold are reported by both technologies to similar levels (e.g. PR-5, PAL). 
While the same trend is detected for inductions of gene expression by both technologies, in 
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the case of repressions, this is not always true (but compare section 5B). We found three 
elements whose sequences encoded senescence associated proteins (Table 11; ests 246D2T7, 
246B12T7, and 212B17T7). They were more than 10-fold suppressed in cimlOS based on 
rnicroarray data. 

Table 11: Comparison between gene expression quantification by the DNA rnicroarray 



and by Northern (RNA) blot analysis. 



Est/clone 


cimlOS 


cim32S 


cwt713 


Homology 




chip 


Northern 


chip 


Northern 


chip 


Northern 




PR] 


8.7 1 


18.9 


8.3 


3.0 


6.1 


12.7 




PR5 


3.3 


3.2 


3.9 


1.7 


6.1 


1.15 




PR2 


3.8 


2.4 


2.8 


1.1 


5.2 


2.0 




JrDrJ.J 


1.4 


0.8 


1.4 


0.3 


1.6 


0.5 




PAL 


3.5 


2.5 


3.4 


3.4 


5.2 


4.6 




NIM1 


2.8 


1.7 


2.3 


0.9 


2.6 


1.5 




CuZnSOD 


1.4 


1.5 


2.1 


1.8 


3.3 


1.0 




CAT2 


1.2 


1.0 


1.1 


2.1 


1.4 


1.5 




CAT3 


1.3 


3.3 


1.2 


2.8 


1.5 


2.7 




118N4T7 


5.7 


3.2 


4.0 


2.0 


7.7 


3.9 


no homology 
XET 4 


92I21T7 


0.2 


0.7 


0.5 


0.9 


0.2 


1.5 


165P23T7 


1.5 


0.4 


2.4 


0.8 


3.4 


0.3 


Brassinosteroid 
regul. Protein 


129F12T7 


3.5 


1.5 


3.5 


1.0 


5.5 


1.5 


squalene epoxidase 


203C22T7 


5.0 


2.9 


4.2 


1.6 


9.3 


2.3 


no homology 


103L19T7 


4.7 


0.9 


3.3 


0.5 


5.1 


0.4 


unknown 


156F15T7 


1.9 


2.9 


2.6 


1.7 


5.9 


1.0 


cp nucl. DNA bind. 
Protein 2 


246D2T7 


0.1 


2.0 


0.6 


0.9 


0.9 


3.0 


Senesc ass prot 3 


246B12T7 


0.1 


0.5 


0.5 


1.1 


1.1 


2.5 


Senesc ass prot 3 


212B17T7 


0.1 


3.0 


0.5 


0.9 


1.0 




Senesc ass prot 3 


E12G2T7 


0.2 


0.8 


0.4 


1.3 


0.2 


0.43 


XET 4 


174N16T7 


2.3 


(1.2) 5 


2.6 


(0.8) 


5.9 


(2.1) 


LTP3 


122B2T7 


3.1 


2.4 


3.2 


1.5 


3.9 


2.1 


Chaperon 60 p 



L For genes on the control plate, average of three values are indicated except for CHS> where two 
spottings failed 

2. chloroplast nucleoid DNA binding protein 

3. Senescence associated protein 

4. Xyloglucan endotransglucosylase related protein 

5. Northern blot quantifications were done with LTP1 as a probe, which is very similar to LTP3 

This suppression was not confirmed to this extent by Northern blot analysis with any of 
the three ests. However, for ests 9212 1T7 and E12G2T7 (both Xyloglucan 
endotransglucosidase related), the reduced gene expression was confirmed by both 
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technologies. Actually, in Northern blot analysis for this gene it was found that not only the 
quantity of this mRNA is reduced in cim mutants but also the length of the mRNA, possibly 
through alternative splicing. In general, it can be concluded that changes in gene expression 
less than 2-fold in microarray experiments are not significant without supplementary 
confirmation, but that stronger fold-inductions are in good accordance to data obtained by 
Northern blot analysis. The linear correlation coefficient between data obtained by Northern 
blot analysis and by DNA microarray, calculated from 40 data pairs, results in r = 0.84, which 
expresses a satisfactory linearity. 

Analysis of the expression profiles of cim mutants 
Besides one control plate and two Synteni control plates for normalization, 102 plates, 
each containg 96 est fragments, were spotted on the DNA microarray. Ideally, this could 
represent 9792 different genes. However, resequencing and BLAST comparisons revealed 
some redundancies in the est population, so that we estimate the actual number of monitored 
genes to about 5000. In cases were sequencing did not match the previously indicated identity 
of the clone, the name of the est was changed. In the hybridization experiments with the three 
cim mutants, 7815 (cimlQS), 8069 (c//n713), and 7186 (czm328) elements gave readable 
signals. In six experiments listed in Table 9 (excluding experiment 4), 1562 different 
elements displayed at least in one hybridization a significantly altered (two-fold) change in 
expression, 1061 of those only in one experiment, 14 in all seven. 478 different elements 
displayed altered gene expression in at least one of the three cim mutants. As shown in table 
12, in mutant cim7\3 many more genes than in the other mutants are differentially regulated 
(499 elements in cimlYh versus 275 elements in cim205 and 180 elements in cim32$ changed 
more than 2-fold). In NahG plants, the expression of 310 genes was changed and BTH 
application altered the expression of 916 genes more than 2-fold compared to untreated wild- 
type. Increasing the cut-off from a 2-fold difference to a 2.5 fold difference in gene 
expression reduces these numbers to less than half, and only few elements display a more than 
3-fold change (tablel2). It should be noted, that for low-intensity read-outs (<1000 
fluorescent signal intensity (FSI)) the correlation curve is biased towards cy5 and for results 
falling into these intensity regions, a 2-fold, or even 3-fold induction is not necessarily 
meaningful. We chose therefore a 2.5 fold cut-off for further analysis (see below). For 
interesting elements with low FSI, a careful analysis will be required. The distribution of the 
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FSI shows that most mRNAs fall into the class of low- to medium-abundant mRNAs (1 - 50 
mRNA per cell; 1000 - 10,000 FSI). Only a few have intensities above 11,000 FSI, which 
corresponds to highly abundant transcripts (100 - 500 transcripts per cell, based on an 
estimated total number of 100,000 transcripts per cell; Kamalay and Goldberg, 1980). The 
largest changes in abundance were observed for low copy mRNAs that are usually more than 
10-fold induced. The highly expressed housekeeping genes do not change the transcription 
rate in the cim mutants. 

Table 12: Counts of elements displaying altered gene expression in three cim mutants, 
in plants treated with BTH and NahG plants. The total number of elements giving valid 
signals is shown, along with the number of elements with at least a 2-fold, 2.5-fold, 3-fold, or 
4-fold difference in signal in the mutant (or treatment) compared to untreated wild-type. The 
five elements in ci*w328 that displayed a more than 4 fold change in expression are three 



elements for PR-1, one element for PR-5 and est 203C22T7 (extensin). 



Number genes induced 


cim205 


cim32S 


ciVw713 


BTH 


NahG 


2-fold 


275 


180 


499 


916 


310 


2.5-fold 


86 


47 


153 


400 


165 


3-fold 


45 


19 


79 


261 


120 


4-fold 


25 


5 


29 


125 


71 


Total valid signals 


7815 


7186 


8069 


7816 


7122 



Because the majority of genes on the microarray stayed unchanged (under given 
experimental settings), we chose to limit the comparative analysis between the three mutants 
to genes that show altered transcription. 

To compare the three mutants to each other, gene groups were created: Genes that 
changed significantly in one mutant (more than 2.5 fold) were grouped together and their gene 
expression data in the other mutants were obtained and plotted in profiles. The expression of 
86 genes with differential expression in mutant cim205 was compared to their expression in 
cim7l3, cim32S and in the BTH control experiment. A characteristic of cimlOS is the group 
of repressed genes around the est 246D2T7 and est 246B12T7 (both derived from genes 
encoding a senescence associated proteins). A third est derived from this gene family, est 
212B17T7 is also strongly suppressed. This reduction in expression is not found in the other 
cim mutants, but is seen in the BTH treated tissue. As an example of a gene strongly induced 
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in c/w205, est 203C22T7 (BLAST similarity: extensin) can be cited. In mutant cim205, as 
many genes are suppressed as induced. 

This is not the case in mutant cimlYh. Most of the 153 selected genes with altered 
expression (cut-off 2.5-fold) in this mutant are upregulated and only a few are downregulated. 
It is evident that many more elements display significant alterations in gene expression in 
cimlXl compared to cww205. Although the spectrum of gene inductions in the other two 
mutants looks very similar to the spectrum of c/m713, those gene inductions are usually 
weaker and not necessarily significant. Several elements with the most dramatic changes in 
expression are annotated and the results of BLAST similarity searches are indicated in table 
13. They include the known SAR genes, and genes encoding for cell-wall modifying proteins, 
such as extensins {est 1 18N4T7 and 203C22T7) and xyloglucan endotransglycosylase related 
proteins {est E12G2T7 and 92I21T7), as well as genes whose induction was not expected, 
such as the genes encoding squalene monooxygenase and a cytochrome P450, and also genes 
with unknown protein function {est 156F15T). 

In mutant c//w328, only 47 genes show more than 2.5 fold alterations in expression 
compared to wild-type. 35 genes are found only in the cim205 gene group (40%), and 107 
genes are unique to the cimlXh gene group (70%). In cim32&, only 13 genes were found that 
did not also have altered expression in one of the other mutants. More than half of the genes 
induced in dw328 showed also altered expression in one or both of the other mutants: 1 1 
elements induced in cim32S were also induced genes in mutant cim205, 6 are induced in 
mutant ciml\3 and 17 genes are common to all three gene groups. These 17 elements are 
listed in table 13. 



Table 13: ests that revealed differential gene expression (fold expression over wild- 
type) in the cim mutants or following BTH treatment (2 days after 0.3 mM BTH treatment) 



est 


cim20S 




ciiw713 


BTH 


Blast similarity 


203C22T7 1 


5.0 


4.2 


9.3 


6.3 


extensin 


212B17T7 


-7.0 


-2.0 


1.0 


-5.8 


senescence associated protein 


246D2T7 


-7.9 


-1.8 


-1.1 


-4.4 


senescence associated protein 


246B12T7 


-7.9 


-1.9 


1.1 


-5.5 


senescence associated protein 


118N4T7 1 


5.7 


4.0 


7.7 


8.4 


no homology/extensin 3 


156F15T7 


1.9 


2.6 


5,9 




no homology 


92I21T7 


■4.2 


-2.0 


-6.1 


-9.4 


xyloglucan endotransglycosylase 












related 


103L19T7 1 


4.7 


3.3 


5.1 


10.1 


cystein protease inhibitor 
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1 1 £T"7 
1 /HSS 101 / 






O.J 




non-spec, LTP 








A Q 


1 1 1 


xyiogiucan enuocransgiycosyiase 
related 


122B2T7 


3.1 


3.2 


3.9 


5.7 


chaperone 60p 


129F12T7 


3.5 


3,5 


5.5 


9.2 


squalene monooxygenase 


PRl (3x) 2 
PR2 (3x) 2 


10.8 


10.O 


9.0 


13.9 


(PRl) 


4.7 


3.4 


6.2 


8.5 


(PR2) 


PR5(3x) 2 


3.6 


4.1 


6.2 


8.8 


(PR3) 


PAL 2 


3.5 


3.4 


5.2 


7.1 


(PAL) 


CHS 2 


3.7 


3.6 


3.6 


4.5 


(CHS) 


105D3T7 


3.5 


3,6 


4.7 


7.9 


cytochrome P450 



1 these genes fall in the class of the 17 genes with expressions altered more than 3 -fold in all three 
mutants 

2 highest value out of three is shown 

3 est 1 18N4T7 is likely to be a chimeric clone 



Interestingly, most of those common genes were already known as marker genes for 
disease resistance (e.g. PR-1), and all of them are also induced by BTH. Two notable 
exceptions are the previously mentioned genes for a cytochrome P450 (unknown substrate 
specificity) and for squalene monooxygenase. Mutant c/m328 has also increased expression of 
extensin genes (est 1 18N4T7, 203C22T7) but only slightly reduced levels of senescence 
associated protein transcript. 

Also, instead of selecting anonymous gene subgroups for comparison, one can also look 
at the expression of genes in specific pathways (Table 14). Expression of genes for proteins in 
the early steps of the Halliwell-Asada pathway had been studied in Northern blot analysis (see 
section 3.E.L). On the DNA microarray, two more genes were added, ascorbate peroxidase 
(APX) and a glutathione peroxidase (GLUPER) gene. Both of them were consistently induced 
about 2-fold in the cim mutants over wild-type (table 10). 

The microarray can thus be a powerful tool in dissecting pathways. In the analysis of 
mutants, gene expression profiles describe more precisely the characteristics than single 
marker genes do. The fingerprints for cim mutants show many similarities (the **S AR 
profile"), but also distinct features that might eventually help to explain phenotypic 
differences of the mutants. 



Table 14: est coverage of genes encoding central enzymes of the phenylpropanoid 
pathway (A) and the Halliwell-Asada cycle (B). ests that are present in the non-redundant set 
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of ests (used in part for the DNA microarray described above) are printed in bold, and the 
relative expression of the corresponding genes in the dm mutants compared to wild-type is 



indicated. 



Gene 


est clones (MSU collection) 


Expression 


( A) Phenylpropanoid pathway 


PALI and PAL2 1 


CDNA, 128N22T7, 187E19T7 


3 - 5-fold induced 


PALS 


82B7T7, 154F7T7,H2C1T7 


No good signal 


cm 


240A11T7, 123D6T7, 126E1T7, 


Not spotted 




118J19T7 




4CL 


G2D3T7, 175B13T7, 116E12T7, 


Not spotted 




69F10T7 




Chalcone synthase 


CDNA, 177N23T7, 187C23T7, 


CDNA: 3 - 4-fold 


34H9T7, 43C1T7, 88D15T7 


induced, est: no 






differential expression 


Chalcone flavanone isomerase 


177A20T7, 240H10T7 


No differential 






expression 


Dihydrqflavonol-4-reductase 


No est 


n.a/ 


Anthocyanidin synthase 


No est 


n.a. 


Jsoflavone reductase 


177A9T7, 122B24T7 


No differential 






expression 


EPSPS 2 


CDNA, 123B10T7,G10B8T7, 


No differential 




62C8T7, 131D24T7 


expression 


(B) Halliwell-Asada cycle 


Glutathione reductase 


G2C3T7 


Not spotted 


Glutathione peroxidase 


CDNA 3 , 118F4T7, 81B2T7, 


2-fold induced 




218D7T7, 229J21T7, 118F6T7, 






250D17T7, 99N6T7 




Dehydroascorbate reductase 


No est 


n.a. 


Monodehydroascorbate 


G7B8T7, 38H7T7, 134D13T7 


No good signal 


reductase 






Ascorbate peroxidase 


CDNA, 172F1T7, 186E1T7, 


1.8-fold induced 




109F7T7, 213G16T7, 90G17T7, 






157B9T7, 251I20T7 




Superoxide dismutase Fe 


E6E5T7 


Not spotted 


Cu/Zn 


CDNA 3 , 156019T7 


2-fold induced 


Mn 


CDNA 3 , 200F22T7, 123N9T7, 


No differential 




105G4T7, 109J19T7, 94E7T7 


expression 



1. The sequences of PALI and PAL2 genes are too similar to identify gene-specific ests. 

2. EPSPS is not part of the phenylpropanoid pathway, but of the shikimate pathway, furnishing the 
phenylpropanoid precursor phenylalanine. 

3. cDNAs were not full-length clones, but only fragments as described in Table 4. 

4. n.a.: not applicable 
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HL Two-dimensional protein gel electrophoresis of cim mutants 
Bacteria possess, on average, four modified polypeptide chains for every three genes, in 
yeast, there are three polypeptides per gene and, in humans, between three and twenty 
different modified polypeptides can be found for each gene. Moreover, there is a poor 
correlation between the abundance of mRNA and their encoded proteins in eukaryotes, 
arguing for the crucial importance of both qualitative and quantitative analysis at the protein 
level (Landegren, 1999). In plant disease defense, post-transcriptional regulation on protein 
level has repeatedly been shown (Conrath et al> 1997; Dietrich et al 9 1990). Two- 
dimensional (2D) polyacrylamide gel electrophoresis can resolve between 2500 and 10,000 
proteins but the low loading capacity, unprecise pH gradients and difficult identification 
(either by comparison or by microsequencing) limit the actual use of this technology 
(Pennington et al, 1997). 2D gel electrophoresis has been used before to characterize 
Arabidopsis mutants (Santoni et al, 1994). We attempted to compare results obtained with 
the DNA microarray to changes in protein patterns. Total protein extracts of cz7w328, czw713, 
cim611 y NahG, c/w713xNahG, and BTH-treated wild-type plants were separated on 2D gel 
electrophoresis. The insufficient separation of proteins made it difficult to distinguish proteins 
in cim mutants that differ from wild-type. Several additional signals are marked that showed 
up in all three repetitions of the experiments. In total, about 10 signals that differ in each 
experiment compared to wild-type can be identified. In the picture showing the BTH control, 
one might expect to identify PR-1 (pi 8.5, 14.9 kDa). However, none of the signals were 
identified by microsequencing. It can be concluded that for subtle changes in expression, as 
found in cim mutants, 2D gel electrophoresis cannot provide enough sensitivity. 

4. Mapping And Cloning Of cim Mutants 

In order to clone the gene responsible for a Cim phenotype, a positional cloning 
approach was attempted. As a first step, the mutants had to be crossed to a genetically diverse 
ecotype and a linkage disequilibrium between the Cim phenotype and the genotype in a 
segregating F2 population had to be identified. For increased resolution, the number of F2 
plants that must be scored increases. For many cim mutants, the expression of the Cim 
phenotype was unstable in mixed ecotype backgrounds. For four cim mutants, a rough map- 
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position (table 6) was obtained by screening through large F2 populations in order to isolate 
with confidence a sufficient number of F2 siblings with clear phenotype. Segregation in F2 
progeny from the outcross of mutant cimlYh most closely resembled aMendelian segregation 
ratio; therefore this mutant was chosen for further cloning efforts. 

A. Phenotyping of mutant ciml 13 

There were a number of problems involved in the phenotyping of ciml 13. First, there is 
no visible, morphological phenotype associated with the PR-1 overexpression phenotype. As 
an alternative, the increased disease resistance of ciml 13 was a potentially useful trait for 
identifying the Cim phenotype in the segregating F2 mapping population. However, there are 
no known pathogen isolates described that grow equally well on the parental A. thaliana 
ecotypes Col-0 zxidLer (these ecotypes were chosen because of the high rate of 
polymorphisms between the two genomes, estimated at 1 nucleotide polymorphism every 250 
bp; Konieczny and Ausubel, 1993). An obvious trait to score in the F2 is the increased 
activity of Pi?-7/luciferase ? the basis for the initial screen. There are several problems 
associated with this approach as well: 

* The crossing partner A. thaliana ecotype her does not contain the Pi?-7/luciferase 
reporter gene, therefore one quarter of the F2 will not be usable for phenotyping. 
Plants containing the PTJ-i/luciferase transgene can be identified by selection on 
kanamycin or by PCR screening for the T-DNA. Although growth on GM plates 
under selection does not induce the PR-1 gene expression, the phenotype changes 
slightly and phenotyping was never solely based on results obtained from those 
experiments. Therefore, a PCR for the luciferase gene was also established and run on 
all F2 plants and on at least 6 F3 progeny, to allow plants homozygous or heterozygous 
for the reporter gene to be distinguished. 

* PR-1 can be induced by various stresses. Although the levels of induction are 
usually small (with the exception of cell death-inducing events), the high sensitivity of 
the P7?-7/luciferase system might make it difficult in some cases to distinguish 
between stress induction and genetic induction. 

These problems can be partly circumvented by using endogenous biochemical markers, such 
as PR-1 gene induction, or accumulation of salicylic acid, or the accumulation of PR-1 
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protein. These techniques were tried to find the most reliable technique to determine the 
phenotype of F2 plants. Because each F2 plant has to be confirmed in the next generation on 
population level (16 F3 plants are required to distinguish between heterozygosity and 
homozygosity of a dominant trait in F2 plants with 99% confidentiality; Koomeef and Stam, 
1 992) the technique must not be labor-intensive. Assuming proper segregation, the required 
number of F2 plants to map a gene in the Arabidopsis genome to a 50 kb region, can be 
derived from the corresponding average resolution of 0.25 cM. With the size of the 
Arabidopsis genome round off to 100,000 kb and 500 cM, on average, 5 cross-over (c.o.) per 
haplotype per meiosis occur. The required population size to find 1 cross-over per 50 kb 
results with 100.000 kb/50 kb = 2000 cross-overs in the population to 2000 c.o. / 5 c.o. per 
haplotype = 400 haplotypes = 200 plants in the F2 population. For the probability to find one 
recombinant in this interval to be no less than P ~ 95%, with E = 200 
n = ln(l-P)/ln(l-l/E) 

= ln(l-0.95)/ln(M/200) 

= 600 F2 plants 

600 F2 plants would have to be screened and confirmed in the F3 on population level. 

This number excludes the determination of SA accumulation by organic extractions. 
Western blot analysis (or ELISA) for the PR.-1 protein was tried with the strongest mutant 
cimSlO. The detection is less sensitive than for the PR-I mRNA. Furthermore, it also requires 
an effort per sample too big for the vast sample number required. 

The detection of the PR-1 marker gene expression by dot blot (RNA) analysis is feasible 
in the F2, and was used to confirm in random samples the phenotype determination by 
luciferase activity but does not help to improve the reliability of the phenotype determination. 
The rate of phenotype miscalls was the same as when F2 plants were scored for luciferase 
activity. The Cim phenotype is only expressed in a small percentage of the F2 plants, the 
penetrance is incomplete (15% vs. 75% expected). 

Hence, only the luciferase in vivo assay would allow the screening of 10,000 F3 plants 
in a reasonable amount of time. Because of the penetrance problem encountered in the Ler 
ecotype, about 5000 F2 plants were screened for constitutive luciferase expression. 855 F2 
plants, for which the phenotype could be called without doubt, were used for mapping 
purposes. Mostly, they were homozygous for the cim mutation. For the 855 F2 plants, the 
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presence of the luciferase gene was confirmed by PCR. The F3 populations were scored on 
both kanamycin selection and without selection. 

To avoid the severe penetrance problem of the mutation in the Ler ecotype, mapping 
crosses to the A. thaliana ecotype Wassilewskija (Ws-ff) were also tried, but the penetrance did 
not improve. 

B. Genetic mapping of ciml\7> 

Several genetic markers have been identified, including SSLP and CAPS markers, that 
are polymorphic between the A. thaliana ecotypes Col-Q and Ler. To get a rough map- 
position, about 30 markers were used on a F2 population of ciml\Z of 65 individuals. 
Linkage to the phenotype was found with the SSLP marker nga280 (at 81.4 cM on the Lister 
and Dean RI map, 2 recombinants in 124 meiosis). The next closest SSLP markers, nga248 
(at 40.0 cM) and ngal 11 (at 1 1 1 .4 cM) showed less linkage. In this genomic region, no PCR- 
based genetic marker was available. RFLP markers were present in the interval, but these are 
difficult to use on individual F2 Arabidopsis plants because of the amount of genomic DNA 
required. To convert RFLP markers into PCR-based markers, several RFLP probe clones 
were sequenced and sequence-specific primers were designed. PCR fragments from both 
parental ecotypes were digested using 48 to 80 different restriction endonucleases to find a 
cleaved amplified polymorphic sequence (CAPS). Polymorphisms were detected as 
differences of fragment sizes after electrophoretic separation. This procedure was successful 
for the RFLP markers mi209, mi304 and mi291a, and for the gene N1A2. No polymorphism 
was detected for RFLP marker mil 06. When the F2 population was scored at the marker 
mi291a, a tight linkage to the phenotype (5 recombinants in 120 meiosis) was detected. Using 
the Kosambi function (map distance D in cM; D = 25 In (1 00 + 2r)/(100 - 2r), where r is the 
recombination frequency expressed as a percentage (Koorneef and Stam, 1992; Kosambi, 
1944); for recombination frequencies r smaller than 4%, the difference between map distances 
D and r becomes less than 1% and r can be used without logarithmic corrections) to convert 
recombination frequencies into map-distances, the distance between the two genetic markers 
mi291a and nga280 was calculated at 9.5 cM (r = 5.7), which corresponds well to the 
published value (1 1 cM). PAP240 was identified as an expressed sequence tag hybridizing to 
a YAC clone in this region. The clone (obtained from M. Raynal, INRA, Perpignan, France) 
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was converted into a CAPS marker in similar manner to the RFLP markers. The marker 
PAP240 divided the genetic distance between the markers mi291a and mi209, and narrowed 
the interval containing the cim mutation to roughly 2 cM. 

Sequencing the clones of the two RFLP markers APK100 and LOX in this interval did 
not detect any polymorphism between Ler and Col-0 DNA. The clones were therefore only 
used as RFLP probes on DNA blots of pooled F3 progeny from individual F2 plants showing 
recombination between PAP240 and mi209. The two markers delimited an interval of 6 and 4 
recombinants from the ciml\Z mutation. This corresponds to a map-distance of 6 / (2 x 855) 
- 0.35 cM on the left side (APK100) and 0.23 cM on the right side (LOX). Ideally, an 
interval of 0.58 cM corresponds to a physical distance of 100 kb in Arabidopsis, and can be 
spanned by one or two BAC clones. 

C. Physical mapping 

The Arabidopsis genome is well represented in several large-capacity vectors. Three 
YAC and two BAC libraries exist that are partly assembled into contigs. From the physical 
map, sequence information can be derived to design new genetic markers and eventually to 
construct a high-resolution genetic map. Sequence information can be obtained from publicly 
available BAC end sequences (http://genome.bio.upenn.edu), YAC end rescue, cloning of 
BAC (random or end-) fragments, or by the identification of ests that hybridize to BAC 
clones. With the rapid progress of the Arabidopsis sequencing project, the chance of finding 
sequence information of entire BAC clones is also increasing. 

The two closest flanking genetic markers, APK100 and LOX were used as anchor points 
on the physical map. They were hybridized to BAC filters containing subsets of the two 
available BAC clone libraries, IGF and TAMU (ABRC stock center). APK1 00 hybridized to 
BAC F16J8, F15I10, F22G10 and F8H4. The LOX probe hybridized to BAC T7N22, T3A10, 
and to the BAC clones F19C6, F9I9, F26H12 and F5P9. From these starting points, a 
physical contig was constructed, using both experimental and non-experimental data. J. Ecker 
(Univ. Pennsylvania, PA), and T. Altmann's laboratory (Max-Planck-Institut fur molekulare 
Pflanzenphysiologie, Golm, Germany) provide hybridization data of BAC clones to BAC end 
fragments and construct BAC contigs (http://www.mpimp- 

golm.mpg.de/101/mpi_mp_map/bac.html). The considered region was however not yet 
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contiged. Washington University (St, Louis, MO) provides Hindm fingerprints of BAC 
clones (http://genome.wustl.edu/gsc/cgi-bin/arab/atdatabase.shtml). Based on similarity in 
restriction fragment patterns, different BAC clones can be aligned with certain probabilities. 
This allows a virtual walk on the chromosome and enabled us to integrate the two genetic 
markers into one physical contig. The contig was confirmed by PCR of BAC end fragments 
on overlapping BAC clones. The minimal pass between the two markers requires most likely 
three BACs (F2H24, T28H10, and F14C21, however, the overlap between T28H10 and 
F14C21 is uncertain. The physical distance between the two genetic markers is therefore 
unlikely to exceed 350 kb (average size of a BAC clone is 100 kb). 

D. Genetic marker development and fine-mapping 

The sequence of one of the BAC clones in the contig, BAC F20D21, became partly 
available (Genbank accession number AC005287; preliminary release from October 3 rd 1998), 
The first attempts to develop CAPS markers from PCR fragments were unsuccessful, 
revealing the presence of a non-polymorphic genomic region. Therefore, the fragments were 
used as probes in RFLP survey blots. Probe 20D21-13 detected a Dral restriction 
polymorphism between the two parental lines CoUO and Ler. The 10 recombinants between 
the two flanking markers were scored at this locus and a cosegregation of this marker with the 
mutation ciml\3 was found (table 15). Based on the theoretical resolution obtainable with 
1710 meiosis that were used to build the linkage map and assuming an even disribution of 
recombination, the mutation should lie within 23 kb of any cosegregating marker. 

Because the development of new markers in this region proved to be extremely difficult 
by the restriction digest approach, alternative methods were also tried. Simple sequence 
repeats, like polydT, or polydCA, were identified on the BAC F20D21 and flanking primers 
were designed to identify length polymorphisms (SSLP). Usually, these repeats are 
meiotically unstable and vary in size between evolutionarily distinct ecotypes. 

Six of these short fragments were amplified from both ecotypes and separated on high 
resolution gels, but no length polymorphisms were detected. 

As a last, most accurate (and costly) solution, systematic sequencing of genomic DNA 
from both parents corresponding to the insert cloned in BAC F20D21 was conducted. PCR 
primers were designed every 1 kb, the slightly overlapping fragments purified and sequenced 
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with the same PCR primer pair. Of the 100 kb of BAC F20D21 that were sequenced, only 7 
single nucleotide polymorphisms between the ecotypes Col-0 and Ler were detected, 5 of 
them were restriction fragment polymorphisms and 4 were converted into CAPS markers 
(orf5 5 20D21-2, orf52, cf2-12). The fifth polymoiphism was already used as the RFLP marker 
20D21-13. This is an extremely low rate of genetic polymorphism and explains the 
difficulties encountered in marker development by random trials. 

Interestingly, all these markers cosegregated with the ciml\1 phenotype, thus extending 
the interval with no recombination event in the mapping population to at least 140 kb, which 
is more than 5 times larger than we estimated for our F2 population size based on genome- 
average recombination. 

In order to mark the limits of the interval, markers that reveal recombinations between 
the mutations and the nearest defined genetic marker loci were needed. The BAC clone 
T22H22, which overlaps with the right (SP6) end of the BAC F20D21 was, similar to 
F20D21 , partly sequenced by the Arabidopsis genome initiative in the course of this work 
(AC005388). PCR fragments on BAC T22H22 were generated of both parental lines and 
sequenced. The locus T26 (at 26 kb from the BAC end F20D21) was polymorphic and the 
recombinant F2 plant number 1006 had a recombination event between this marker and the 
mutation, thus limiting the physical interval on the right side (table 1 5). 

To limit the interval on the left side of BAC F20D21, all available BAC end sequences 
between APK100 and F20D21-2 were used in Southern blot analysis to find restriction length 
polymorphisms (the BAC end sequences were too short to develop CAPS markers), but no 
such polymorphism was detected. Therefore other mismatch detection methods, used 
primarily in mammalian mapping projects and routine identification of known mutations, 
were tried. 

Heteroduplex analysis detects changes in confirmation of DNA duplexes caused by 
single base pair mismatches. PCR fragments in the corresponding genomic region, are 
denatured, mixed and hybridized with PCR products of the second parental ecotype. Any 
point mutation results in the formation of two heteroduplexes as well as two homoduplexes. 
The heteroduplexes have altered confirmations, which can be detected by the altered 
migration in a polyacrylamide gel. Hauser et ah found in Arabidopsis 50% of 36 loci (230 bp 
to 1000 bp PCR fragments) to be polymorphic (Hauser et ah, 1998). Although we reproduced 
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the published polymorphisms, we did not identify new markers by this method using short 
PCR fragments that were derived from the available BAC end sequences in the interval of 
interest. Similarly, we did not detect poolymorphisms with a second method, the RNase 
cleavage assay that is supposed to detect 90% of all mismatches (Ambion, Austin, TX). 
Specific PCR primers for 5 BAC end fragments between the markers APK100 and 20D21-2 
with T7 and SP6 promoter sequences were used for thermocycle amplification, followed by an 
in vitro transcription and a cross hybridization of the transcripts of the two ecotypes. 
Different endoRNases were used in vain to cleave potential mismatches. Therefore, as before, 
sequencing seemed to be the last solution to identify the rare nucleotide polymorphisms in this 
genomic region between the two ecotypes Col-0 and Ler. Using long (26-mer) PCR primers 
from several BAC end sequences in the direction of BAC F20D21, a long range PCR 
fragment between BAC end F17M20 and the left (T7) end of BAC F20D21 was amplified and 
cloned into the vector pCR2.1. These additional 9.1 kb were sequenced by random 
transposon integration. Marker development based on this sequence is in progress that will 
hopefully limit the cosegregating interval on the left side of BAC F20D21. 

The bad genetic resolution left us with a region of around 150 kb, in which the cim gene 
should be localized. In order to identify possible candidate genes, open reading frames (ORF) 
on BAC F20D21 and T22H22 were identified using consensus prediction programs (Genscan, 
http://ccr.081.mit.edu/Genscan.html). BAC F20D21 contained 28 putative genes. For some, 
the prediction was confirmed by the presence of ests in the databank or by Northern blot 
analysis. ORFs were sequenced in the Col-0 wild-type and ciml\3 mutant. In cases where 
the sequenced fragments did not overlap with the next fragment, Northern blot analysis was 
performed to detect potential point mutations in promoter elements, leading to changes in 
gene expression. One gene with homology to a TMV resistance protein showed a 5-fold 
induction of expression in the mutant cz/w713. The expression of a serine/threonine protein 
kinase in this interval was 3-fold reduced in the mutant. 
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Table 15: Recombinant F2 plants between the genetic markers PAP240 and nga280 on 



chromosome 1 in the mapping population of ciml\3. 
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1 recombinant 382 did not set seeds in the F2, therefore no data on RFLP markers were obtained. 

2 n.d.: not determined 



Other genes did not show alterations in expression. In the two genes with altered expression, 
no mutations on nucleotide level were detected in the coding region or in the promoter The 
changes might therefore be a consequence rather than the cause of the mutant phenotype. 
Although almost the entire cosegregating region on BAC F20D21 and T22H22 was 
sequenced, no (point) mutation was identified in mutant czw713. 
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Two additional mRNA species were detected using the long range PCR fragment on 
BAC F17M20 as a probe in Northern blot analysis, and neither was differentially expressed in 
the mutant. The presence of two genes in this interval was also predicted by the Genscan 
program after the sequence had been obtained. One of the genes was identified as a homolog 
of the mammalian ataxin gene. The long range PCR fragment has not yet been sequenced for 
the mutant c/m713. 

E. Cloning of flanking DNA in cim mutants generated by T-DNA tagging 
A second screen for cim mutantswas performed in a population of mutants generated by 
T-DNA tagging and seven additional cim mutants were identified (table 16). Segregation 
analysis for Basta resistance in segregating T2 populations revealed that for most of the 
mutants, several T-DNAs were integrated into the genome. For lines 27001, 28492, and 
33673, no progeny plant died in populations comprising at least 50 individuals in the selection 
for Basta resistance, which suggests that at least three T-DNA inserts may be segregating 
independently. 

The number of inserts was also estimated by probing genomic Southern blots from each 
of the mutants with T-DNA probes (using pBluescript and the RB), This method usually gave 
a higher estimate of T-DNA inserts than segregation analysis. There are two possible reasons 
for this discrepancy. Multiple T-DNAs may have inserted at a single genetic locus, or partial 
T-DNAs, lacking the BAR gene but containing pBluescript and RB segments may have been 
inserted. 
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Table 16: Genetic analysis of cim mutants generated by T-DNA insertion. Out of 
10,000 primary transformants, 7 lines were retained that showed reproducible Pi?-7/luciferase 
activity in the next generations. Those were submitted to segregation analysis on selective 
media (Basta) and Southern blot analysis to identify T-DNAs that cosegregate with the 



phenotype. Plasmid rescue, or TAIL PCR was performed to clone flanking genomic DNA. 
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1. Luciferase in vivo activity was compared to mutant cim713 (set to ++)* Stronger activity was rated 

+-H\ 

2. No plant died (out of more than 50 T2 plants) 

3. n.t.: not tried 



In many cases, no specific T-DNA was correlated to the expression of the Cim 
phenotype. Because the mutations are recessive or codominant, the plants without (or with a 
weak) Cim phenotype might be heterozygous at the cim locus, thus giving a signal in the 
Southern blot analysis that does not match a visible phenotype. Alternatively, T-DNA- 
induced, but not linked, genomic rearrangements might cause the observed phenotype. 

For line 27001, plasmid rescue was tried for the RB flanking sequence with an EcoRI, 
or HindM digest. The left border can not be rescued because of the lack of unique restriction 
sites. 

Several cloned flanking genomic fragments were sequenced and the inserts compared by 
BLAST similarity search to sequences in the Arabidopsis genome or elsewhere. Two flanking 
sequences were derived from sequenced parts of the A. thaliana genome, two had homology 
to Arabidopsis ests and two had no significant homology. One plasmid contained only the T- 
DNA cloning vector, probably integrated into the genome by inefficient cleavage of the T- 
DNA at the border sequence during the transformation. 

For five lines, a TAIL PCR was performed to get LB- and RB-flanking genomic DNA. 
For all lines, TAIL PCR was successful and several fragments per line were either subcloned 
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or directly submitted for sequencing. Some fragments contained only the T-DNA sequence, 
possibly because of tail-to-head or head-to-head cointegrations of several T-DNAs into the 
genome. In several cases, known genes or sequenced genomic regions were cloned. The 
fragments were used in Southern blot analysis to confirm the insertions and to identify the 
gene knock-out that cosegregated with the phenotype. These genes will be transformed into 
the mutants to establish wild-type phenotypes and to genetically prove the cloning of cim 
genes. 

5. Epistasis studies with cim mutants 

Several other mutants involved in disease resistance became available during the course 
of this study* Most of them were loss-of-function mutations and for understanding the genetic 
signaling pathway it is important to establish the epistatic relationship between these mutants 
and the cim mutants. However, several problems occur: 

• The Cim phenotype is not expressed in 100% of the progeny, making genetic 
analysis in F2 populations difficult, especially if no genetic marker linked to the 
mutation can be followed. 

• Several loss-of- function mutants are not in the same ecotype. For instance the edsl 
mutant exists in the A. thaliana ecotype Ler and Ws-0> This renders analysis of 
resistance more difficult because of the presence of race-specific R genes (no 
Peronospora parasitica isolate has been identified that infects both Col-0 and Ler). 
In addition, crosses to other ecotypes increase the penetrance problem, as described 
previously. 

Despite these problems, crosses of several cim mutants to the disease resistant 
compromised mutants niml~4 t pad4 9 edsl and ndrJ, as well as to the jasmonic acid signaling- 
deficient mutant jarl and the ethylene-insensitive etrl mutant have been obtained. They can 
be analyzed after a cim gene has been cloned so that the genotype can be confirmed in the F2. 
Crosses to the SA-degrading NahG line could be analyzed in the Fl, because both, the NahG 
phenotype and the Cim phenotype are inherited in a (co~) dominant manner. 
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A. Analysis of cim mutants in NahG backgrounds 

All cim mutants displayed increases in SA content, with the exception of the "weakest" 
mutant dw658. To address whether SA accumulation is required for the primary Cim 
phenotype (i.e. PR gene expression), all 16 cims were crossed to the SA-degrading NahG line 
and monitored for Pi{-//luciferase gene expression in the Fl by in vivo photon emission. Fl 
plants were confirmed for double marker antibiotic resistance by plating on selective media 
(kanamycin for presence of the luciferase marker gene, and hygromycin for the presence of the 
NahG gene) and by RNA blot analysis for the hygromycin gene. PiJ-Muciferase gene 
expression was suppressed in all crosses (at least three crosses per mutant with 10 to 20 plants 
per cross), revealing the dependence of PR-1 gene expression on prior SA accumulation. 

Resistance to P. parasitica Noco2 was, surprisingly, not completely abolished in at least 
two mutants (cim695 and cim7l3) in combination with NahG (mutants cim&, c/w658, and 
cim677 were not tested). It is unlikely that this resistance is due to residual SA because PR-1 
gene expression was suppressed and because other mutants that accumulated 2-fold more SA 
than mutant cim713 nevertheless lost the Nocol resistance phenotype in a NahG background 
(e.g. mutant c/w810). Gene expression profiles for Fl plants before and after pathogen 
treatment showed that constitutive transcription of the endogenous PR-1 gene is indeed 
suppressed. Similarly, the NIM1/NPR1 gene, which is usually activated two- to threefold 
during an SAR response, was not induced in presence of NahG, These results demonstrate 
that S A accumulation is not a general requirement for Peronospora resistance in at least two 
cim mutants. Interestingly, we did not observe a correlation of S A-independent resistance and 
PDFL2 gene expression, as previously suggested (Bowling et al 9 1997). Mutant cim2Q5 
expressed PDF1.2 to higher levels than wild-type, cim824 did not. Both mutants are 
susceptible to P, parasitica in a NahG cross. Mutant cim695 does not constitutively express 
the PDFL2 gene, yet PDF 12 transcription was induced when this mutant was crossed to a 
NahG line. In contrast, mutant cimlYh expressed elevated PDF 1.2 mRNA levels that were 
decreased in a NahG background, but resistance to P. parasitica was retained in both mutants. 
Thus, transcription of neither SAR genes nor PDF 7.2 (as a putative marker for 
ethylene/jasmonate mediated inducible resistance) is increased in cim7\3 when crossed to 
NahG, revealing a novel, unrelated mechanism for resistance to P. parasitica. 
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B. Gene profiling for cim mutants in NahG backgrounds 

In order to correlate the SA-independent disease resistance mechanism of mutant 
c//k713 to marker genes, and to monitor gene expression changes of cim mutants in a NahG 
background, three DNA microarray experiments were conducted (experiments 5, 6, and 7 in 
table 9). Using the results of these experiments, the gene expression of the mutants cim205 
and cimlYi in a NahG background can be compared to the expression profile of both the 
NahG plants, and to profiles of the mutants themselves (using information of the experiments 
1 and 3 in table 9), 280 elements that showed altered gene expression (2.5 fold over wild- 
type) in at least one of the experiments were chosen for comparison. Compared to previous 
profiles, the strong differences in gene expression are remarkable. Several genes were strongly 
upregulated in NahG (up to 40-fold), others show up to 100-fold reductions in gene 
expression in NahG compared to wild-type. In general, these genes are also altered in the cim 
mutants in NahG backgrounds (although NahG is not a mutant sensu stricto, the crosses of 
cim mutants to the NahG line are in the following chapter for the ease of wording referred to 
as "double mutants".) Interestingly, most of those genes have no homology to entries in 
public databases (table 1 7). As with the cim mutants (table 1 1), data obtained by Northern 
blot analysis correspond well with microarray data, which are usually somewhat higher (table 
17). The overall pattern of gene induction in NahG compared to the double mutants revealed 
that NahG imposes many changes in gene expression such that the three profiles are all very 
similar. Again, as with the cim mutants alone, a common subset of genes with similar 
expression patterns in the double mutants and NahG (the "NahG profile") can be identified, 
including the elements with the most extreme changes, est 90G10T7 (+41.4), 140E3T7 
(+23.1), 198G3T7 (-26.3), 127P22T7 (-67.3) and 103C7T7 (-21 .5). There are nevertheless 
differences between the profiles. For instance, cim205 x NahG expresses more than 30 genes 
5- to 10-fold stronger than wild-type that are not differentially expressed in NahG or c/w713 x 
NahG. About 20 genes that are reduced between 5- to 10-fold in NahG are either not as 
strongly repressed in the double mutants or are not repressed at all. Thus, there are 
differences between the two double mutants and NahG. 
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Table 17: ests that revealed differential gene expression in cim205 x NahG and cim713 
x NahG on the DNA microarray ('*chip") 5 and the values obtained in Northern blot analysis 



("blot'*). 



esf/clone 


homology 


BTH 


NahG 


cim205xNahG 


citn713*NahG 






blot 


chip 


blot 


chip 


blot 


chip 


blot 


chip 


198G3T7 


none 


11.3 


5.6 


-8.8 


-26.3 


-16.5 


^0.6 


-19.5 


-40.1 


103C7T7 


unknown 


3.7 


2.3 


-10.7 


-21.5 


-6.0 


-13.8 


-12.1 


-28.8 


CXC750 1 




5.5 


3.4 


-1.7 


-10.5 


-9.1 


-65.3 


-10.6 


-19.1 


93C1T7 


unknown 


5.1 


3.0 


4.5 


-97.2 


-6.2 


-22.0 


-8.6 


-13.4 


124K13T7 


CXC750 


5.1 


3.4 


-1.7 


-65.3 


-7.6 


-10.5 


-12.5 


-19.1 


127P22T7 2 


unknown 


2.3 


5.1 


-1.2 


-67.5 


-1,3 


-66.3 


-1.3 


-30.0 


140E3T7 


unknown 


1.6 


-1.2 


1.4 


23.1 


5.4 


17.9 


28.8 


23.6 


119D12T7 


none 


4.3 


-1.6 


2.1 


21.7 


4.1 


9.5 


3.6 


17.8 


119C13T7 


none 


1.7 




1.3 


9.9 


3.6 


10.8 


2.1 


12.8 


90G10T7 


none 


1.8 


1,1 


1.3 


41.4 


4.3 


29.5 


2.4 


38.0 


103L19T7 


unknown 


15.4 


10.1 


-2.0 


-3.9 


-2.9 


-4.7 


-2.5 


-2.8 


PR2 




6.5 


n.t 3 


-6.1 


n.t. 


-13.5 


n.t. 


-8.3 


n.t. 


PR5 




7.9 


n.t. 


-4.1 


at. 


-10.5 


n.t. 


-4.7 


n.t. 


PAL 




2.4 


n.t. 


-3.7 


n.t. 


-9.6 


n.t. 


-5.6 


n.t. 


172A5T7 


none 


5.9 


n.t. 


-9.0 


n.t. 


-10.4 


n.t. 


-13.2 


n.t. 


163B24T7 


none 


4.6 


n.t. 


-16,8 


n.t. 


-13.9 


n.t 


-26.8 


n.t. 


105D3T7 


unknown 


7.9 


n.t. 


-8,6 


n.t. 


-3,0 


n.t. 


-11.6 


n.t 


178N5T7 


none 


4.3 


n.t. 


-6.7 


n.t. 


-107 


n.t. 


-8.7 


n.t. 


177C23T7 


none 


5.8 


n.t. 


-7.2 


n.t. 


-5.2 


n.t. 


-10,1 


n.t. 



1. not the same ests were used in Northern blot analysis and DNA microarray 

2. Northern blot signal very weak 

3. n.t.: not tested 

The second question addresses the similarity between gene expression in the mutants 
compared to the mutants in NahG backgrounds. It is already clear that fundamental changes in 
gene expression are introduced with the NahG gene. Many genes that are induced in cim 
mutants (including PR-2 and PR-5) were repressed by the presence of NahG. One of the 
strongest changes in expression occurred for the gene CXC750 that is pathogen-inducible 
(Aufsatz and Grimm, 1994) and whose expression is obviously SA-dependent (table 17), The 

compared to wild-type because its basal levels in wild-type are already low. Others (most 
prominently est 140E3T7, 198G3T7, 103C7T7) did not change the expression in cim mutants 
but are unique to the NahG line and the double mutants. And a few genes retained the same 
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expression pattern in both the cim mutant and the double mutants (for example some elements 
around the est 93C1T7). Of the 200 genes that are altered more than 2.5-fold in the cross 
cim205 x NahG, 56 are also altered more than 2.5 fold in cim205 alone, but in only one case is 
gene expression changed in the same direction. All the other genes were altered inversely. 
This complementary effect on gene expression of the cim mutants and the NahG lines might 
be directly compared by eliminating the intrinsic base of comparison (wild-type). Subtracting 
the expression values of one experiment from another (A[cim x NahG; wt] - A[cim; wt] = 
A[cim x NahG; cim]) shows that only very few differential expression values are close to zero, 
i. e. most of the expression values of cims and NahG lines are different. Clearly, the changes 
in gene expression imposed by NahG are radical and do not only concern genes required for 
catechol degradation. Salicylic acid might have a major importance for the regulation of 
many genes. Again, in the 2D protein gel electrophoresis these changes are not well reflected. 

6. Using Expression Profiling To Study The Transcriptome of Arabidopsis During S AR 

To gain a more thorough understanding of the complex regulation of gene expression 
during SAR, and to identify new (marker) genes associated with SAR, we used expression 
profiling (Schena etal y 1995; Shalone/a/., 1996). PolyA+ RNAs obtained from plants 
grown under 18 different conditions that either induce or repress SAR (see Table 18 below) 
were used in mixed hybridizations against RNA of untreated wild-type plants on a DNA 
microarray with 10,000 ESTs, a Unigene set representing roughly one third of all Arabidopsis 
genes. 

Hybridizations were done by Synteni, Inc., Fremont, CA as described by Ruan, et al, 
1998. For spiking controls, refer to http://www.synteni.com/client/controls/index.html. We 
included several quality controls to check the accuracy of the DNA microarray data. Spot-to- 
spot variability was estimated using 30 cDNAs spotted three times each. Measured variability 
was usually smaller than 20%. The inverse labeling of sample cimll not only showed the 
reproducibility of independent experiments ( linear regression coefficient r - 0.83), but also 
the symmetry of labeling. We also compared more than 40 DNA microarray data points to 
Northern blot results and found a linear correlation of r — 0.83. 
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The EST "Unigene" set was obtained from the Arabidopsis Biological Resource Center, 
(Ohio State University, Columbus, OH). 10,000 of the 14,000 clones were amplified by PCR 
using the M13 universal reverse and -21 forward primer, with modified 5' amino end for 
spotting onto a glass slide* A fraction of the ESTs, including all mentioned in this manuscript, 
were resequenced at our facility. Roughly 30 % of the EST sequences did not correspond to 
the original sequence as represented in the AATDB. Based on our own BLAST search and on 
estimations made by others (Delseny et aL t 1997), we extrapolated the redundancy of the 
"Unigene" set to 1.5 to 2-fold. Hence, while the majority of the checked genes match only 
one EST in our set, many match at least two ESTs. One explicit example can be found in the 
PR1 regulon: Disregarding the three CDNA clones PR1, PrxC, and PR5 that were added 
purposely several times on the microarray, 19 ESTs represent 14 different genes. 

In addition, two treatments which alter plant metabolism, but are not related to SAR, 
were analyzed (Table 18). Some samples were taken during the induction phase of SAR (4 
hours after BTH induction), or from primary, pathogen-infected tissue, but most of the 
samples were derived from (steady state) SAR maintenance phase (e.g. 48 hours after BTH 
induction, cim mutants). Together, the experiments gave us 1.8 x 10 5 gene expression data 
points. Under these 18 conditions, however, only 660 different genes displayed significant 
differential expression compared to wild-type in at least two SAR-relevant samples 
(significance level defined as 2.5 fold differences from wild-type). By comparing the 
fluorescence signal intensities to spiked controls, abundance of mRNA species can be 
determined . Most of the genes with altered expression fall into the class of low-abundance 
transcripts (1 to 10 copies per cell), while a few were in the medium abundance class. 
Housekeeping genes, with more than 100 copies per cell, did not exhibit altered gene 
expression under SAR. 
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Table 18: Diversity of conditions used to describe the transcriptome of Arabidopsis 



thaliana dining SAR. 



Sample (all compared to wild-type tissue) 


Comments 


Adenylosuccinate synthetase antisense 
Hydantocidin (0.9 mM) 


Reference inductions, irrelevant to resistance 


cimO 
dm? 
cimll 


Constitutive SAR mutants 


NIM1 overexpresser 


Primed SAR response 


NahG 

cim6NahG 

cimll^ahG 

NahG + Pst DC3000 avrrpml T tissue, 44 hours 


No SA accumulation, no SAR response 


NahG + Pst DC3000 avrrpml V tissue, 44 hours 
BTH 4 hours 
BTH 48 hours 


Primary (partial) LAR induction 
SAR induction 
SAR maintenance 


P.p. EMWA1, 48 hours 

Pst DC3000 avrrpml V tissue, 44 hours 

Pst DC3000 avrrpml T tissue, 44 hours 

nim + Pst DC3000 avrrpml T tissue, 44 hours 

P.p. NocoS, 48 hours 


Compatible (disease causing) interaction 
Incompatible interaction 



First, we compared gene induction patterns of these 660 genes among each of the 
different conditions. Data representing a series of differential gene expression measurements 
was obtained. Analysis of both the regulation of individual genes under varying conditions or 
of the overall similarity of the various conditions based on expression profiles required the 
calculation of distance matrices. These were calculated in S-Plus using the dist function under 
the euclidean metric. Clustering of similarly regulated genes was performed and visualized in 
S-Plus using hierarchical clustering under the compact method with the, functions hclust and 
plclust Trees used to depict similarity in gene expression patterns among the conditions may 
be produced using the fitch and drawtree programs distributed in the Phylip suite (Phylogeny 
Inference Package) version 3.57c. 

Out of 10,000 ESTS, 661 that displayed differential gene expression under two or more 
of the 16 SAR-inducing or -repressing conditions were used to calculate similarities in gene 
expression profiles. To indicate a scale of similarity between two different conditions, we 
introduced a unit of similarity, which corresponds to the summed difference of relative gene 
inductions under two conditions over all 660 genes, calculated according to the formula: 
Sum g enesi-n(Abs(A(expressioncondi» expressionc 0n d2)))/n with n = number genes (661). This 
simplified scale compares well to the results obtained by the more complex statistical methods 
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and resulted for the pair (cim7, cimll) in an average difference of gene induction of 1.1 units 
per gene, compared to 3.7 units for the condition pair (cimll, NahG). 

Genetic, chemical and biological induction of SAR all caused very similar patterns of 
gene induction. The three cim mutants, which constitutively express SAR, have a pattern of 
transcriptional induction similar to that caused by BTH during SAR maintenance. 
Interestingly, the overexpression of NIM, which primes the SAR response, also resembles 
SAR maintenance (U.S. Patent No. 6,031,153; Cao et aL, 1998; Maleck et al, 1998). 
Compatible and incompatible races of the pathogens Peronospora parasitica and 
Pseudomonas syringae induced many common genes, although the pattern of gene induction 
was not identical to either of the BTH-treated samples* In contrast to the requirement for SA 
accumulation to high levels for SAR induction, many gene inductions are triggered by low 
levels of SA* For example, in NahG-expressing plants that accumulate only low levels of SA, 
the expression of more than 300 genes is significantly altered. This emphasizes a general 
regulatory role for SA in the plant cell. 

NahG suppresses SAR gene expression in crosses to two of the SAR-constitutive cim 
mutants, cim6 and cimll, to a baseline resembling that of NahG-expressing plants. In 
addition, NahG expression results in a characteristic gene expression fingerprint in secondary 
tissue from plants inoculated in primary tissue with avirulent bacteria. This corresponds to the 
inability of these plants to establish SAR. Interestingly, the corresponding primary tissues in 
NahG-expressing plants display changes in gene expression which compares very closely to 
wild-type primary, infected tissue and this sample does not cluster with other NahG samples. 
This is consistent with findings from grafting experiments that SA is required for SAR 
development in systemic tissue, but that the systemic signal can be emitted from primary 
infected NahG-expressing tissue (Delaney et al, 1994; Gaffhey et al, 1993; Vernooij et aL, 
1994). 

We used these expression profiles to identify classes of genes that were consistently 
coregulated, and we sought to identify genes associated with SAR. We used statistical 
phylogenetic cluster analysis to compare the gene expression profiles among 660 genes under 
all 18 conditions. Clusters of genes with similar induction patterns under all conditions were 
grouped. The two clusters most distinct from the PR1 gene induction pattern, with EST 
127P22T7 and EST 163B24T7 as "type genes," contain genes that are strongly (more than 20 
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fold) suppressed by NahG expression. The cluster containing chalcone synthase (ChS) as the 
type gene includes genes induced during SAR-maintenance, but not during S AR-induction; 
these are weakly repressed by NahG expression. The cluster containing EST 209E19T7 
defines genes that are transcriptionally induced in NahG-expressing plants. The cluster 
containing EST 118P18T7 defines genes that are not significantly responsive to SAR- 
inducing conditions like chemical and genetic induction, but do respond to avirulent bacteria 
and are downregulated in NahG expressing plants. Phenylalanine ammonia lyase (PAL) and 
20 other ESTs that cluster together are repressed by NahG expression, but are induced during 
the maintenance phase of S AR, for example in dm mutants or 48 hours after BTH treatment. 
The cluster of "PR1 like" genes exhibits similar induction behavior to genes in, the PAL gene 
cluster but these genes are only weakly suppressed in NahG-expressing plants. 

To address the question of expression induction relatedness among genes within one 
cluster, we analyzed the PR1 cluster in detail. The PR1 regulon contained 25 other ESTs (17 
different genes). These are prime candidates for SAR marker genes and the encoded proteins 
are likely to play a physiological role in SAR. The estimated 1.5 to 2-fold redundancy of our 
EST set is a good internal control for this analysis and we also included three replicates of the 
PR5 and the PerC cDNAs (as well as 28 other relevant cDNAs) on the DNA-microarray* All 
three copies of the two genes cluster with PR1, showing the robustness of the DNA 
microarray analysis. Similarly, two ESTs that are derived from Asparagine synthetase, two 
from the gene for blue copper binding protein and two from glutathionine S-transferase (GST) 
cluster with PR1. The standard deviations for differences in expression of the 30 members of 
the PR1 cluster were smaller than one unit of differential gene expression (compared to 2.5 
units cut-off for our definition of significant alterations) under all 18 conditions. Thus, genes 
in one cluster show highly reproducible expression in our analysis. 

To help explain the molecular basis of this co-regulation, we analyzed the promoters of 
those eight PR1 cluster ESTs encoded within sequenced genomic regions. Strikingly, these 
genes all shared a common TTGAC consensus cis-element known to bind proteins of the 
WRKY zinc finger transcription factor binding family, often in multiple copies. In control 
promoters of eight ESTs selected randomly among the 634 non-PRl-like ESTs, the element 
TTGAC was only found in the statistically expected frequency. The WRKY element was not 
present in three of the randomly selected promoters, in three promoters once, in one promoter 
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twice, and in one promoter three times. WRKY elements are sufficient for defense gene 
transcriptional induction in different plant systems (Lebel et ai, 1998). A functional analysis 
of the BTH-induction of the PR1 promoter revealed two elements sharing all or four out of 
five consensus WRKY sites at -676 and - 638 (Lebel et al y 1998). The former acts as a 
repressor and the latter as a SA responsive element. It is therefore likely that WRKY 
transcription factors are responsible for the common induction pattern of the eight genes listed 
below in Table 19, and possibly also for the remaining 22 ESTs of the PR1 gene cluster for 
which no genomic sequence is presently available. 

We did not identify another common consensus sequence in the eight PR1 cluster 
promoters analyzed. The NFkB binding site found in the PR1 promoter which is required for 
full BTH-induction is not conserved among the eight promoters within the PR1 regulon. 

Table 19; Genes in the PRl-like cluster contain a common cis-element For the 26 ESTs 
clustering with PR1, genomic sequence was obtained, if available. 1 kb fragments upstream 
of the EST end or the predicted gene (if annotated) were scanned for common promoter 
elements. Numbering of the upstream sequences starts at the 5' end of the longest available 



EST, except for the PR1 promoter, for which primer extension has been performed (Lebel et 
al P 1998). 



EST 


homology 


cis element 


position 


Genomic sequence of EST or 
gene 


cDNA 


PR1 


ATTGAC 
TATGAC 
ATTGAC 
CTTGAC 
ATTGAC 


-676 to -671 
-638 to -643 
-544 to -539 
-492 to -487 
-181 to -176 


AF096294 (PR1 promoter) 


134C20T7 


No homology 


ATTGAC 
TTTGAC 
TTTGAC 


-799 to -794 
-735 to -731 
-335 to -330 


ABO17065, 16134- 15793 


192K7T7 


No homology 


TTTGAC 


-841 to -836 


AC002330, 99468-99711 


147P23T7 


Myrosinase binding 
protein 


TTTGAC 


-857 to-852 


AC001645, 7088 -7298 


245M18T7 


Tyrosine 
aminotransferase 


TTTTGAC 
CTTTGx\C 


-653 to -648 
ino m\a 


AC0O6585, 81840 -83922 


121I3T7 


No homology 


GTTGAC 
TTTGAC 


-996 to -991 
-792 to -787 


ABOH485, 48839-48492 


179C14T7 


7-ethoxy-coumarin- 
O-deethylase 


ATTGAC 
TTTGAC 
CTTGAC 
TTTGAC 


-892 to -887 
-813 to -808 
-745 to -740 
-18 to -13 


AC003680, 33897 - 33546 



- 116- 



WO 00/65039 



PCT/USOO/11460 



184F12T7 


Late embryonic 
abundant protein 
(SAG21) 


TTTGAC 
CTTGAC 
TTTGAC 
TTTGAC 


-847 to -842 
-671 to -666 
-117 to -112 
-36 to -31 


AF069298, 63830 - 64550 


WRKY 

consensus 




TTGAC 




Dorey et al, 1998 



In addition to identification of common regulatory elements, cluster analysis of 
expression profiles provides a tool to derive physiological functions of genes. This is 
important for sequences with no close homologs in the databank (for example EST 
134C20T7 or EST 192 K7T7) and also for genes with structural similarity to genes with 
known function (such as asparagine synthetase). We anchored the results of our statistical 
analysis first to those genes that were known to be co-regulated with PR1 . Interestingly, 
several genes in the PR1 cluster encode proteins involved in redox regulation. Although 
reactive oxygen intermediates do not likely act as secondary messengers in SAR downstream 
of SA accumulation, they are clearly implicated in defense response, either as local second 
messenger molecules, as direct cellular and microbial toxins or in cell-wall cross-linking and 
other oxidative processes (Alvarez et al> 1998). From the several known Arabidopsis 
peroxidases, peroxidase C seems to be specifically induced under plant defense conditions, as 
are some GSTs. Supporting this is the previous demonstration that both peroxidase C and 
GST are transcriptionally activated in a superoxide-dependent manner in the Arabidopsis 
mutant Isdl, which lacks the ability to halt hypersensitive-response-like lesions (Jabs et al, 
1996; Dietrich et al t 1994). Likewise, the blue-copper protein might be involved in the 
regulation of redox stages during SAR or might be the chelator of free cellular copper used for 
Cu/Zn SOD. Cu/Zn SOD is suppressed during SAR (Fodor et al, 1997). 

It has been proposed that plant metabolism adapts to pathogen attack (Batz et al, 1998). 
We can now more precisely identify such changes and investigate their importance in disease 
resistance. For example, it is instructive to speculate about the possible role of the 
coregulation of glutamine-dependent asparagine synthetase with the PR1 cluster during SAR. 
An increased flux of carbon through the phenylpropanoid pathway (lower overall N:C ratio) 
might liberate nitrogen, thus creating a demand for asparagine synthetase as a detoxifying 
enzyme. Similarly, pathogen infection increases the level of free ammonia in the plant, 
following an increased metabolism of proteins (Batz et al, 1998). Asparagine synthetase can 
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also utilize ammonia as a substrate. Hence, our observation that asparagine synthetase is co- 
regulated with PR1 focuses attention on physiological changes during SAR that might not be 
directly linked to defense. Alternatively, the induction of asparagine synthetase might be 
related to the induction of tyrosine transaminase. This is also the first time that the production 
of toxic mustard oils, released by myrosinase from glucosinolates, can be correlated to SAR in 
Arabidopsis. 

Our analysis describes the first map of the plant defense transcriptome during SAR in 
Arabidopsis. Using cluster analysis of DNA microarray data, significant changes in gene 
induction can be differentiated from random correlations. The large number of data points 
filters out coincidental co-induction and enhances the resolution and significance of serial 
analysis of expression profiles. It is clear that the description of complex signaling networks 
can only be obtained by investigating multiple inductive conditions, and not, as before, by 
pair- wise comparisons. Thus, similar analysis of additional mutants and inducing conditions 
will further enlarge the complexity and refine the resolution of the entire Arabidopsis 
transcriptome. 

7. Using Expression Profiling To Identify Inducible Promoters 

By comparing gene expression changes across various treatments, groups of co- 
regulated genes (regulons) were identified and the genomic sequences of genes within the 
regulons were examined to identify common sequence motifs likely to act as regulatory 
elements. These regulatory elements may then be used to make promoters that drive 
controlled gene expression. As described above, gene expression profiling using DNA 
microarrays was used to identify groups of genes that show a similar pattern of expression in 
response to biotic and abiotic stimuli, especially the 18 biotic and abiotic inducers of SAR 
described above in Table 18. Expression profile data uncovered groups of genes responsive to 
exogenous factors but not endogenous signals. 

Using PCR select and microarray gene chip technology described above, genes were 
identified that are responsive to BTH and/or pathogens. Furthermore, we compared 
expression profiles in response to biotic and abiotic inducers of SAR and assessed the 
requirement for salicylic acid and the NIM1 gene for rnRNA accumulation. By analysis of data 
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generated with cDNA microarrays, sets of genes that are responsive specifically to exogenous 
application of BTH were identified (see Table 20 below). The discovery of a set of BTH- 
inducible genes that are not responsive to SA or pathogens was especially surprising given the 
expectation from previous studies of the S AR signal transduction pathway that BTH would 
always act as a functional analog of SA and would also activate the SAR response in the same 
manner as pathogen infection. The regulatory regions from these differentially expressed 
genes can be isolated using conventional cloning techniques and used as specifically inducible 
promoters;, such as BTH-specific promoters. 

Table 20: Induced genes based on >5x BTH induction in a wild-type Arabidopsis line at 
4, 24, and 48 hours post-BTH and in the niml-4 mutant Arabidopsis line at 4 and 24 hours 
post-BTH. These are candidate genes for BTH-specific promoters that are induced by BTH 
but not SA or pathogens. Genes below the double line are induced by BTH in the niml-4 
mutant, but are not induced by SA or pathogen. Gene names (EST IDs) in bold are induced 
more than 5x by BTH and are also induced in the niml-4 mutant line by BTH (NIM1 
independent genes), x: not in cluster analysis; # indicates cluster analysis result. Numbers 
indicate fold-induction relative to control treatment. If no number is present, fold-induction is 
less than 2.5. 



Gene Name 
(EST ID) 


Cluster 


Sequence 
verified 


Wildtype 
BTH 4h 


Wildtype 
BTH 24h 


Wildtype 
BTH 48h 


niml-4 
BTH 4h 


niml-4 
BTH 24h 


H2D10T7 


x 






6.5 


5.6 






91B11T7 


x 






14.9 


5.9 






90N23T7 


X 










6.6 




241N24T7 


X 




7.7 






5.7 




240N6T7 


X 




5.3 










229F6T7 


X 




7.5 






8.4 




223M15T7 


X 




5.2 






7.3 




194G1T7 


3 


Y 








6.5 




190M3T7 


X 




5.4 










190C24T7 


3 


ACO02333 




5.1 








177C23T7 


4 


P48980 




5.2 


5.8 






175H5T7 


9 


Y 




5.8 








168A5T7 


X 








5,9 






167I19T7 


8 


Y 




5.2 








166P4T7 


8 


Y 




6.1 








166N10T7 


4 


Y 






5.1 






163L17T7 


X 






5.2 


6.1 
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162J11T7 


0 


Y 


*7 A 

1 A 










15902T7 


8 


ZA9697 




5.1 








159016T7 


8 


Y 




6.5 








158A15T7 


X 




5.3 










141D8T7 


X 














134G3T7 


X 








6.1 






134F18T7 


X 








8 






126N11T7 


10 


xt 
Y 






7.1 






125H21T7 


8 


Y 




6.7 








124A5T7 


4 


103D24T7 






6.4 






122F9T7 


X 






6.4 


8.5 






122B7T7 


8 


Y 






6.8 






122B24T7 


4 


Y 






5.6 






122B23T7 


8 


ACOo4484 




5.7 


6.3 






122A22T7 


8 


121J12T7 




6.9 








121H22T7 


X 






10.2 








121G2T7 


7 


Y 






5.8 






121G21T7 


8 


Y 




5.1 








121B21T7 


8 


Y 






5.6 






119N23T7 


X 






9 








118L22T7 


X 






12.4 


6.6 






118K13T7 


X 






8.3 








H7P7T7 


X 






11.3 








117P14T7 


8 


118L7T7 




8.6 


7.8 






117P13T7 


8 


XT' 

Y 




9.8 


5.2 








X 














1 1 TM7 1 T7 


A 














1 1 /vJiZ I / 


o 
5 


V 




^ 7 








1 1 7P 1 T7 
1 1 /HI X / 


X 






Q 








1 17"P1 RT7 


y 


1 1 OPw J J. / 




Ui J 








1 1 7n^T7 


X 














1 1 TTY> 1 T7 

11/ XJZ 11/ 


X 


















v 






^ 7 






90N23T7 


X 




5.1 


2.3 


1.2 


o.o 




241N24T7 


X 




mm mm 

1.1 


l.l 


1 a 
-1.6 


5./ 


2.8 


229F6T7 


X 




7.5 


2.4 


1.1 






228G7T7 


3 


y 


A 

4 


7 A 

2.4 


J. 2 


7 7 

2.2 


3.o 


223M15T7 


X 




5-2 


7 a 
2.3 


1.3 


/.3 


3.3 


222N23T7 


X 




4 


2.4 


2.2 


2.1 


3.8 




n 
/ 






^ i 

-j. jl 




i i 

1 . X 


14 


202O18T7 


X 




3.1 


-1 


-1.2 


4,3 


1.6 


194G1T7 


3 


Y 


3.7 


A 2 


-2,6 


6.5 


1.4 


191H15T7 


X 




3.1 


2 


1.6 


1.3 


3.2 


177H9T7 


X 




3.9 


2.3 


1.9 


2.1 


3.8 


172I13T7 


X 




3.2 


2 


-1.4 


1.9 


5 


162J3T7 


7 


Y 


3.8 


-1.9 


2.1 


1.5 


4.8 


162J11T7 

(N -gene) 


0 


Y 


7.4 


-U 


5 


1.2 


3.7 
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155E20T7 


X 




3 


3.1 


2.3 


1.1 


4.2 


125K7T7 


X 




4.8 


1.2 


-1.4 


3.1 


1.6 


109K9T7 


X 




-7.8 


2.3 


3,5 


-10.3 


5 


107A4T7 


X 




3.9 


-1.3 


-2.3 


3.7 


1.7 
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What is claimed is: 

1 . A method for isolating a regulatory DNA sequence from a differentially expressed gene, 
comprising: 

(a) obtaining an expression profile of a genome under two or more different conditions; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a gene that is differentially expressed among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the gene identified as being 
differentially expressed and regulatory elements associated with said gene; and 

(e) isolating a regulatory DNA sequence associated with the differentially expressed gene 
from the genomic DNA. 

2. A method for isolating a common regulatory DNA sequence from a group of co- 
regulated genes, comprising: 

(a) obtaining an expression profile of a genome under two or more different conditions; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a group of genes that are co-regulated with respect to each other but that are 
differentially expressed with respect to the rest of the genome among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the genes identified as being 
differentially expressed and regulatory elements associated with said genes; 

(e) identifying common regulatory DNA sequences among the genomic DNA associated 
with the co-regulated genes; and 

(f) isolating a common regulatory DNA sequence associated with the differentially 
expressed genes from the genomic DNA, 

3. A method according to claim 1 or claim 2, wherein said regulatory DNA sequence is a 
promoter. 

4. A method according to claim 1 or claim 2, wherein said genome is a plant genome. 
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5. A method according to claim 1 or claim 2, wherein said expression profile is obtained 
using a DNA microarray. 

6. A method according to claim 1 or claim 2, wherein said two or more different 
conditions comprise biotic stimuli, 

7. A method according to claim 1 or claim 2, wherein said two or more different 
conditions comprise abiotic stimuli. 

8. A method according to claim 1 or claim 2, wherein said two or more different 
conditions comprise biotic and abiotic stimuli. 

9. A method according to claim 1 or claim 2, wherein said genome is a plant genome and 
wherein said two or more different conditions comprise at least one SAR inducing condition 
or at least one SAR repressing condition. 

10. A method according to claim 9, wherein said least one SAR inducing condition 
comprises pathogen infection, S A application, BTH application, NIM1 gene expression, or a 
cim mutation. 

11. A method according to claim 9, wherein said least one SAR repressing condition 
comprises NahG expression or a niml mutation. 

12. A method for isolating a regulatory DNA sequence from a differentially expressed plant 
gene, comprising: 

(a) using DNA microarray technology to obtain an expression profile of a plant genome 
under two or more different conditions, wherein said conditions comprise at least one SAR- 
inducing or SAR-repressing condition; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 
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(c) identifying a gene that is differentially expressed among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the gene identified as being 
differentially expressed and regulatory elements associated with said gene; and 

(e) isolating a regulatory DNA sequence associated with the differentially expressed gene 
from the genomic DNA. 

13. A method for isolating a common regulatory DNA sequence from a group of co- 
regulated plant genes, comprising: 

(a) using DNA microarray technology to obtain an expression profile of a genome under 
two or more different conditions, wherein said conditions comprise at least one S AR-inducing 
or SAR-repressing condition; 

(b) comparing gene induction and repression patterns among each of the different 
conditions; 

(c) identifying a group of genes that are co-regulated with respect to each other but that are 
differentially expressed with respect to the rest of the genome among the different conditions; 

(d) obtaining the sequence of genomic DNA comprising the genes identified as being 
differentially expressed and regulatory elements associated with said genes; 

(e) identifying common regulatory DNA sequences among the genomic DNA associated 
with the co-regulated genes; and 

(f) isolating a common regulatory DNA sequence associated with the differentially 
expressed genes from the genomic DNA. 

14. A method according to claim 12 or claim 13, wherein said regulatory DNA sequence is 
a promoter. 

15. A method according to claim 12 or claim 13, wherein said least one SAR inducing 
condition comprises pathogen infection, S A application, BTH application, NIM1 gene 
expression, or a dm mutation. 

16. A method according to claim 12 or claim 13, wherein said least one SAR repressing 
condition comprises NahG expression or a niml mutation. 
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17. A method according to claim 12 or claim 13, wherein said two or more different 
conditions comprise BTH application, and at least one condition selected from the group 
consisting of pathogen infection, SA application, and a niml mutation. 

1 8. A method according to claim 17, wherein said two or more different conditions 
comprise BTH application, pathogen infection, and S A application, and wherein one or more 
genes are identified that are inducible by BTH application but not by pathogen infection or S A 
application. 

19. A method according to claim 17, wherein said two or more different conditions 
comprise BTH application and a niml mutation, and wherein one or more genes are identified 
that are inducible by BTH application in niml mutant plants. 

20. A method according to claim 18 or claim 19, wherein said one or more genes are 
inducible at least 5x by BTH application. 



-151- 



WO 00/65039 



SEQUENCE LISTING 

<110> Maleck, Klaus 
Lawton , Kay 
Dietrich, Robert 

<120> IDENTIFICATION OF DNA CONTROL ELEMENTS RESPONSIVE 
SPECIFIC STIMULI 

<130> A-30901A 

<140> 
<141> 

<160> 8 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<400> 1 

taatacgact cactataggg 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<400> 2 

atttaggtga cactatagga 



<210> 3 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<400> 3 

ttcccaacag ttgcgcagc 



<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<400> 4 

tcttcgctat tacgccagct 



<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: 
oligonucleotide primer 

<4O0> 5 

tcacacagga aacagctat 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<4O0> 6 

acacaacata cgagccggaa 

<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 
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